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12 Human Secreted Proteins 

Field of the Invention 

This invention relates to newly identified polynucleotides and the polypeptides 
encoded by these polynucleotides, uses of such polynucleotides and polypeptides, and 
their production. 

Background of the Invention 

Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 
organelle, contains different proteins essential for the function of the organelle. The cell 
uses "sorting signals " which are amino acid motifs located within the protein, to target 
proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a leader 
sequence, directs a class of proteins to an organelle called the endoplasmic reticulum 
(ER). The ER separates the membrane-bounded proteins from all other types of proteins. 
Once localized to the ER, both groups of proteins can be further directed to another 
organelle called the Golgi apparatus. Here, the Golgi distributes the proteins to vesicles, 
including secretory vesicles, the cell membrane, lysosomes, and the other organelles. 

Proteins targeted to the ER by a signal sequence can be released into the 
extracellular space as a secreted protein. For example, vesicles containing secreted 
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proteins can fuse with the cell membrane and release their contents into the extracellular 
space - a process called exocytosis. Exocytosis can occur constitutive! y or after receipt 
of a triggering signal. In the latter case, the proteins are stored in secretory vesicles (or 
secretory granules) until exocytosis is triggered. Similarly, proteins residing on the cell 
5 membrane can also be secreted into the extracellular space by proteolytic cleavage of a 
"linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins include 
the commercially valuable human insulin, interferon, Factor VIII, human growth 
10 hormone, tissue plasminogen activator, and erythropoeitin. Thus, in light of the 

pervasive role of secreted proteins in human physiology, a need exists for identifying and 
characterizing novel human secreted proteins and the genes that encode them. This 
knowledge will allow one to detect, to treat, and to prevent medical disorders by using 
secreted proteins or the genes that encode them. 

15 

Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, antibodies, 
20 and recombinant and synthetic methods for producing the polypeptides and 

polynucleotides. Also provided are diagnostic methods for detecting disorders and 
conditions related to the polypeptides and polynucleotides, and therapeutic methods for 
treating such disorders and conditions. The invention further relates to screening 
methods for identifying binding partners of the polypeptides. 

25 
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Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain terms 
5 used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is altered 
"by the hand of man" from its natural state. For example, an isolated polynucleotide 
could be part of a vector or a composition of matter, or could be contained within a cell, 
10 and still be "isolated" because that vector, composition of matter, or particular cell is not 
the original environment of the polynucleotide. The term "isolated" does not refer to 
genomic or cDNA libraries, whole cell total or mRNA preparations, genomic DNA 
preparations (including those separated by electrophoresis and transferred onto blots), 
sheared whole cell genomic DNA preparations or other compositions where the art 
15 demonstrates no distinguishing features of the polynucleotide/sequences of the present 
invention. 

In the present invention, a "secreted" protein refers to those proteins capable of 
being directed to the ER, secretory vesicles, or the extracellular space as a result of a 
20 signal sequence, as well as those proteins released into the extracellular space without 
necessarily containing a signal sequence. If the secreted protein is released into the 
extracellular space, the secreted protein can undergo extracellular processing to produce 
a "mature" protein. Release into the extracellular space can occur by many mechanisms, 
including exocytosis and proteolytic cleavage. 

25 

In specific embodiments, the polynucleotides of the invention are at least 15, at 
least 30, at least 50, at least J 00, at least 125, at least 500, or at least 1000 continuous 
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nucleotides but are less than or equal to 300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, 7.5 
kb, 5 kb, 2.5 kb, 2.0 kb, or 1 kb, in length. In a further embodiment, polynucleotides of 
the invention comprise a portion of the coding sequences, as disclosed herein, but do not 
comprise all or a portion of any intron. In another embodiment, the polynucleotides 
5 comprising coding sequences do not contain coding sequences of a genomic flanking 
gene (i.e., 5' or 3' to the gene of interest in the genome). In other embodiments, the 
polynucleotides of the invention do not contain the coding sequence of more than 1000, 
500, 250, 100, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 1 genomic flanking gene(s). 

As used herein, a "polynucleotide" refers to a molecule having a nucleic acid 

10 sequence contained in SEQ ID NO:X or the cDNA contained within the clone deposited 
with the ATCC. For example, the polynucleotide can contain the nucleotide sequence of 
the full length cDNA sequence, including the 5* and 3' untranslated sequences, the coding 
region, with or without the signal sequence, the secreted protein coding region, as well as 
fragments, epitopes, domains, and variants of the nucleic acid sequence. Moreover, as 

15 used herein, a "polypeptide" refers to a molecule having the translated amino acid 
sequence generated from the polynucleotide as broadly defined. 

In the present invention, the full length sequence identified as SEQ ID NO:X was 
often generated by overlapping sequences contained in multiple clones (contig analysis). 
A representative clone containing all or most of the sequence for SEQ ID NO:X was 

20 deposited with the American Type Culture Collection ("ATCC"). As shown in Table 
XIII, each clone is identified by a cDNA Clone ID (Identifier) and the ATCC Deposit 
Number. The ATCC is located at 10801 University Boulevard, Manassas, Virginia 
201 10-2209, USA. The ATCC deposit was made pursuant to the terms of the Budapest 
Treaty on the international recognition of the deposit of microorganisms for purposes of 

25 patent procedure. 

A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences contained 
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in SEQ ID NO:X, the complement thereof, or the cDNA within the clone deposited with 
the ATCC. "Stringent hybridization conditions" refers to an overnight incubation at 42 
degree C in a solution comprising 50% formamide, 5x SSC (750 mM NaCl, 75 mM 
trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% 
5 dextran sulfate, and 20 jig/ml denatured, sheared salmon sperm DNA, followed by 
washing the filters in 0.1 x SSC at about 65 degree C. 

Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
10 accomplished through the manipulation of formamide concentration (lower percentages 
of formamide result in lowered stringency); salt conditions, or temperature. For 
example, lower stringency conditions include an overnight incubation at 37 degree C in a 
solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M NaH 2 P0 4 ; 0.02M EDTA, 
pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml salmon sperm blocking DNA; followed 
15 by washes at 50 degree C with 1XSSPE, 0.1% SDS. In addition, to achieve even lower 
stringency, washes performed following stringent hybridization can be done at higher 
salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress background 
20 in hybridization experiments. Typical blocking reagents include Denhardt's reagent, 
BLOTTO, heparin, denatured salmon sperm DNA, and commercially available 
proprietary formulations. The inclusion of specific blocking reagents may require 
modification of the hybridization conditions described above, due to problems with 
compatibility. 

25 Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as 

any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
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complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone generated using oligo dT as a primer). 

The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA 
or modified RNA or DNA. For example, polynucleotides can be composed of single- 
and double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, 
single- and double-stranded RNA, and RNA that is mixture of single- and double- 
stranded regions, hybrid molecules comprising DNA and RNA that may be single- 
stranded or, more typically, double-stranded or a mixture of single- and double-stranded 
regions. In addition, the polynucleotide can be composed of triple-stranded regions 
comprising RNA or DNA or both RNA and DNA. A polynucleotide may also contain 
one or more modified bases or DNA or RNA backbones modified for stability or for 
other reasons. "Modified" bases include, for example, tritylated bases and unusual bases 
such as inosine. A variety of modifications can be made to DNA and RNA; thus, 
"polynucleotide" embraces chemically, enzymatically, or metabolically modified forms. 

The polypeptide of the present invention can be composed of amino acids joined 
to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may 
contain amino acids other than the 20 gene-encoded amino acids. The polypeptides may 
be modified by either natural processes, such as posttranslational processing, or by 
chemical modification techniques which are well known in the art. Such modifications 
are well described in basic texts and in more detailed monographs, as well as in a 
voluminous research literature. Modifications can occur anywhere in a polypeptide, 
including the peptide backbone, the amino acid side-chains and the amino or carboxyl 
termini. It will be appreciated that the same type of modification may be present in the 
same or varying degrees at several sites in a given polypeptide. Also, a given 
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polypeptide may contain many types of modifications. Polypeptides may be branched , 
for example, as a result of ubiquitination, and they may be cyclic, with or without 
branching. Cyclic, branched, and branched cyclic polypeptides may result from 
posttranslation natural processes or may be made by synthetic methods. Modifications 
include acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of 
flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or 
nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent 
attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, 
demethylation, formation of covalent cross-links, formation of cysteine, formation of 
pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor formation, 
hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, proteolytic 
processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer- 
RNA mediated addition of amino acids to proteins such as arginylation, and 
ubiquitination. (See, for instance, PROTEINS - STRUCTURE AND MOLECULAR 
PROPERTIES, 2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York 
(1993); POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. 
C. Johnson, Ed., Academic Press, New York, pgs. 1-12 (1983); Seifter et al., Meth 
Enzymol 182:626-646 (1990); Rattan et al., Ann NY Acad Sci 663:48-62 (1992).) 

"SEQ ID NO:X M refers to a polynucleotide sequence while "SEQ ID NO:Y" 
refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table XIII. 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
activity similar, but not necessarily identical to, an activity of a polypeptide of the 
present invention, including mature forms, as measured in a particular biological assay, 
with or without dose dependency. In the case where dose dependency does exist, it need 
not be identical to that of the polypeptide, but rather substantially similar to the dose- 
dependence in a given activity as compared to the polypeptide of the present invention 
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(i.e., the candidate polypeptide will exhibit greater activity or not more than about 25- 
fold less and, preferably, not more than about tenfold less activity, and most preferably, 
not more than about three-fold less activity relative to the polypeptide of the present 
invention.) 



Polynucleo tides and Polypeptides of the Invention 



FEATURES OF PROTEIN ENCODED BY GENE NO: 1 



The translation product of this gene shares sequence homology with a protein 
from Xenopus laevis that is described as upregulated in response to thyroid hormone in 
tadpoles, and is thought to be important in the tail resorption process during Xenopus 
laevis metamorphosis (See Proc. Natl. Acad. Sci. USA (1996 Mar. 5):93(5): 1924-9, 
which is herein incorporated by reference). In addition, translation product of this gene 
shares sequence homology with a recently described group of proteins, called hedgehog 
interacting proteins (HIPs) (See International Publication No. W098/12326, which is 
herein incorporated by reference). These proteins bind to hedgehog polypeptides such as 
Shh and Dhh with high affinity (Kd approx. 1 nM). HIPs exhibit spatiallyand temporally 
restricted expression domains indicative of important roles in hedgehog-mediated 
induction. They regulate differentiation of neuronal cells, regulate survival of 
differentiated neuronal cells, proliferation of chondrocytes, proliferation of testicular 
germ line cells and/or expression of patched or hedgehog genes. The biological activity 
of this polypeptide is assayed by techniques known in the art, otherwise disclosed herein 
and as described in International Publication No, W098/12326, which is herein 
incorporated by reference. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: 
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MLRTSTPNLCGGLHCRAPWLSSGILCLCLIFLLGQVGLLQGHPQCLDYGPPFQPP 

LHLEFCSDYESFGCCDQHKDRRIAARYWDIMEYFDLKRHELCGDYIKDILCQEC 

SPYAAHLYDAENTQTPLRNLPGLCSDYCSAFHSNCHSAISLLTNDRGLQESHGRD 

GTRFCHLLDLPDKDYCFPNVLRNDYLNRHLGMVAQDPQGCLQLCLSEVANGLR 

NPVSMVHAGDGTHRFFVAEQVGVVWVYLPDGSRLEQPFLDLKNIVLTTPWIGD 

ERGFLGLAFHPKFRHNRKFYIYYSCLDKKKVEKIRISEMKVSRADPNKADLKSER 

VILEIEEPASNHNGGQLLFGLDGYMYIFTGDGGQAGDPFGLFGNAQNKSSLLGK 

VLRIDVNRAGSHGKRYRVPSDNPFVSEPGAHPAIYAYGIRNMWRCAVDRGDPIT 

RQGRGRIFCGDVGQNRFEEVDLILKGGNYGWRAKEGFACYDKKLCHNASLDDV 

LPIYAYGHAVGKSVTGGYVYRGCESPNLNGLYIFGDFMSGRLMALQEDRKNKK 

WKKQDLCLGSTTSCAFPGLISTHSKFI1SFAEDEAGELYFLATSYPSAYAPRGSIYK 

FVDPSRRAPPGKCKYKPVPVRTKSKRIPFRPLAKTVLDLLKEQSEKAARKSSSAT 

LASGPAQGLSEKGSSKKLASPTSSKNTLRGPGTKKKARVGPHVRQGKRRKSLKS 

HSGRMRPSAEQKRAGRSLP (SEQ ID NO: 47). Also preferred are polypeptides 

comprising the mature polypeptide which is predicted to consist of residues 42-724 of the 

foregoing sequence, and biologically active fragments of the mature polypeptide. 

Figures 1 A-C show the nucleotide (SEQ ID NO: 1 1) and deduced amino acid 
sequence (SEQ ID NO:29) of this protein. 

Figure 2 shows the regions of similarity between the amino acid sequences of 
SEQ ID NO:29, the Xenopus laevis tail resorption protein (gi|1234787) (SEQ ID 
NO:48), and the Hedgehog Interacting Protein ("HIP"; gi|AAD31 172.1) (SEQ ID 
NO:49). 

Figure 3 shows an analysis of the amino acid sequence of SEQ ID NO: 29. 
Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; amphipathic 
regions; flexible regions; antigenic index and surface probability are shown. 

Northern analysis indicates that a 2.5-3.0 kb transcript of this gene is expressed 
primarily in testes tissue and A549 lung carcinoma tissue, but interestingly is absent from 
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normal lung tissue. This gene is also expressed in osteoarthritis tissue and human fetal 
tissues. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 
5 Figures 1 A-C (SEQ ID NO:29), which was determined by sequencing a cloned cDNA. 
The nucleotide sequence shown in Figures 1A-C (SEQ ID NO: 11) was obtained by 
sequencing a cloned cDNA, which was deposited on Nov. 17, 1998 at the American 
Type Culture Collection, and given Accession Number 203484. The deposited gene is 
inserted in the pSport plasmid (Life Technologies, Rockville, MD) using the Sall/NotI 

10 restriction endonuclease cleavage sites. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO:l 1 is intended DNA fragments at least about 15nt, and more preferably at least about 

15 20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO: 1 1. By a fragment at least 20 nt in 

20 length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 11. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of polynucleotide fragments of the invention include, 

25 for example, fragments that comprise, or alternatively, consist of, a sequence from about 
nucleotide 1 to about 50, from about 51 to about 100, from about 101 to about 150, from 
about 151 to about 200, from about 201 to about 250, from about 251 to about 300, from 
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about 301 to about 350, from about 351 to about 400, from about 401 to about 450, from 
about 451 to about 500, and from about 501 to about 550, and from about 55 1 to about 
570 of SEQ ID NO: 1 1 , or the complementary strand thereto, or the cDNA contained in 
the deposited gene. In this context "about" includes the particularly recited ranges, larger 
or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. In 
additional embodiments, the polynucleotides of the invention encode functional 
attributes of the corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions. The data representing 
the structural or functional attributes of the protein set forth in Figure 3 and/or Table I, as 
described above, was generated using the various modules and algorithms of the 
DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XIV of Table I can be used to determine regions of the 
protein which exhibit a high degree of potential for antigenicity. Regions of high 
antigenicity are determined from the data presented in columns VIII, IX, XIII, and/or 
XIV by choosing values which represent regions of the polypeptide which are likely to 
be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 3, but may, as 
shown in Table I, be represented or identified by using tabular representations of the data 
presented in Figure 3. The DNA*STAR computer algorithm used to generate Figure 3 
(set on the original default parameters) was used to present the data in Figure 3 in a 
tabular format (See Table I). The tabular format of the data in Figure 3 is used to easily 
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determine specific boundaries of a preferred region. The above-mentioned preferred 
regions set out in Figure 3 and in Table I include, but are not limited to, regions of the 
aforementioned types identified by analysis of the amino acid sequence set out in Figures 
1 A-C (SEQ ID NO:29). As set out in Figure 3 and in Table I, such preferred regions 
5 include Garnier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, 
Chou-Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic 
regions and Hopp-Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic 
regions, Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index 
and Emini surface-forming regions. Even if deletion of one or more amino acids from the 

10 N-terminus of a protein results in modification of loss of one or more biological 

functions of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 

15 complete or mature polypeptide are removed from the N-terminus. Whether a particular 
polypeptide lacking N-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 

20 activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence shown in 
Figures 1 A-C, up to the alanine residue at position number 524 and polynucleotides 

25 encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues n 1-524 of Figures 1 A-C, where nl is an 
integer from 1 to 524 corresponding to the position of the amino acid residue in Figures 
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1 A-C (which is identical to the sequence shown as SEQ ID NO:29). N-terminal deletions 
of the polypeptide of the invention shown as SEQ ID NO:29 include polypeptides 
comprising the amino acid sequence of residues: V-2 to P-529; A-3 to P-529; Q-4 to P- 
529; D-5 to P-529; P-6 to P-529; Q-7 to P-529; G-8 to P-529; C-9 to P-529; L-10 to P- 
5 529; Q-l 1 to P-529; L-12 to P-529; C-13 to P-529; L-14 to P-529; S-15 to P-529; E-16 to 
P-529; V-17 to P-529; A-18 to P-529; N-19 to P-529; G-20 to P-529; L-21 to P-529; R- 
22 to P-529: N-23 to P-529; P-24 to P-529; V-25 to P-529; S-26 to P-529; M-27 to P- 
529; V-28 to P-529; H-29 to P-529; A-30 to P-529; G-31 to P-529; D-32 to P-529; G-33 
to P-529; T-34 to P-529; H-35 to P-529: R-36 to P-529; F-37 to P-529; F-38 to P-529; V- 

10 39 to P-529; A-40 to P-529; E-41 to P-529; Q-42 to P-529; V-43 to P-529; G-44 to P- 
529; V-45 to P-529; V-46 to P-529; W-47 to P-529; V-48 to P-529; Y-49 to P-529; L-50 
to P-529; P-51 to P-529; D-52 to P-529; G-53 to P-529; S-54 to P-529; R-55 to P-529; L- 
56 to P-529; E-57 to P-529; Q-58 to P-529; P-59 to P-529; F-60 to P-529; L-61 to P-529; 
D-62 to P-529; L-63 to P-529; K-64 to P-529; N-65 to P-529; 1-66 to P-529; V-67 to P- 

15 529; L-68 to P-529; T-69 to P-529; T-70 to P-529; P-71 to P-529; W-72 to P-529; 1-73 to 
P-529; G-74 to P-529; D-75 to P-529; E-76 to P-529; R-77 to P-529; G-78 to P-529; F- 
79 to P-529; L-80 to P-529; G-81 to P-529; L-82 to P-529; A-83 to P-529; F-84 to P-529; 
H-85 to P-529; P-86 to P-529; K-87 to P-529; F-88 to P-529; R-89 to P-529; H-90 to P- 
529; N-91 to P-529; R-92 to P-529; K-93 to P-529; F-94 to P-529; Y-95 to P-529; 1-96 to 

20 P-529; Y-97 to P-529; Y-98 to P-529; S-99 to P-529; C-100 to P-529; L-101 to P-529; 
D-102 to P-529; K-103 to P-529; K-104 to P-529; K-105 to P-529; V-106 to P-529; E- 
107 to P-529; K-108 to P-529; 1-109 to P-529; R-l 10 to P-529; 1-1 1 1 to P-529; S-l 12 to 
P-529; E-l 13 to P-529; M-l 14 to P-529; K-l 15 to P-529; V-l 16 to P-529; S-l 17 to P- 
529; R-l 18 to P-529; A-l 19 to P-529; D-120 to P-529; P-121 to P-529; N-122 to P-529; 

25 K-123 to P-529; A-124 to P-529; D-125 to P-529; L-126 to P-529; K-127 to P-529; S- 
128 to P-529; E-129 to P-529; R-130 to P-529; V-131 to P-529; 1-132 to P-529; L-133 to 
P-529; E-134 to P-529; 1-135 to P-529; E-136 to P-529; E-137 to P-529; P-138 to P-529; 
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A-139 to P-529; S-140 to P-529; N-141 to P-529; H-142 to P-529; N-143 to P-529; G- 
144 to P-529; G-145 to P-529; Q-146 to P-529; L-147 to P-529; L-148 to P-529; F-149 
to P-529; G-150 to P-529; L-151 to P-529; D-152 to P-529; G-153 to P-529; Y-154 to P- 
529; M-155 to P-529; Y-156 to P-529; 1-157 to P-529; F-158 to P-529; T-159 to P-529; 
5 G-160 to P-529; D-161 to P-529; G-162 to P-529; G-163 to P-529; Q-164 to P-529; A- 
165 to P-529; G-166 to P-529; D-167 to P-529; P-168 to P-529; F-169 to P-529; G-170 
to P-529; L-171 to P-529; F-172 to P-529; G-173 to P-529; N-174 to P-529; A-175 to P- 
529; Q-176 to P-529; N-177 to P-529; K-178 to P-529; S-179 to P-529; S-180 to P-529; 
L-181 to P-529; L-182 to P-529; G-183 to P-529; K-184 to P-529; V-185 to P-529; L- 

10 186 to P-529; R-187 to P-529; 1-188 to P-529; D-i 89 to P-529; V-190 to P-529; N-191 to 
P-529; R-192 to P-529; A-193 to P-529; G-194 to P-529; S-195 to P-529; H-196 to P- 
529; G-197 to P-529; K-198 to P-529; R-199 to P-529; Y-200 to P-529; R-201 to P-529; 
V-202 to P-529; P-203 to P-529; S-204 to P-529; D-205 to P-529; N-206 to P-529; P-207 
to P-529; F-208 to P-529; V-209 to P-529; S-210 to P-529; E-21 1 to P-529; P-212 to P- 

15 529; G-213 to P-529; A-214 to P-529; H-215 to P-529; P-216 to P-529; A-217 to P-529; 
1-218 to P-529; Y-219 to P-529; A-220 to P-529; Y-221 to P-529; G-222 to P-529; 1-223 
to P-529; R-224 to P-529; N-225 to P-529; M-226 to P-529; W-227 to P-529; R-228 to 
P-529; C-229 to P-529; A-230 to P-529; V-231 to P-529; D-232 to P-529; R-233 to P- 
529; G-234 to P-529; D-235 to P-529; P-236 to P-529; 1-237 to P-529; T-238 to P-529; 

20 R-239 to P-529; Q-240 to P-529; G-241 to P-529; R-242 to P-529; G-243 to P-529; R- 
244 to P-529; 1-245 to P-529; F-246 to P-529; C-247 to P-529; G-248 to P-529; D-249 to 
P-529; V-250 to P-529; G-25 1 to P-529; Q-252 to P-529; N-253 to P-529; R-254 to P- 
529; F-255 to P-529; E-256 to P-529; E-257 to P-529; V-258 to P-529; D-259 to P-529; 
L-260 to P-529; 1-261 to P-529; L-262 to P-529; K-263 to P-529; G-264 to P-529; G-265 

25 to P-529; N-266 to P-529; Y-267 to P-529; G-268 to P-529; W-269 to P-529; R-270 to P- 
529; A-271 to P-529; K-272 to P-529; E-273 to P-529; G-274 to P-529; F-275 to P-529; 
A-276 to P-529; C-277 to P-529; Y-278 to P-529; D-279 to P-529; K-280 to P-529; K- 
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281 to P-529; L-282 to P-529; C-283 to P-529; H-284 to P-529; N-285 to P-529; A-286 
to P-529; S-287 to P-529; L-288 to P-529; D-289 to P-529; D-290 to P-529; V-291 to P- 
529; L-292 to P-529; P-293 to P-529; 1-294 to P-529; Y-295 to P-529; A-296 to P-529; 
Y-297 to P-529; G-298 to P-529; H-299 to P-529; A-300 to P-529; V-301 to P-529; G- 
302 to P-529; K-303 to P-529; S-304 to P-529; V-305 to P-529; T-306 to P-529; G-307 
to P-529; G-308 to P-529; Y-309 to P-529; V-310 to P-529; Y-31 1 to P-529; R-312 to P- 
529; G-313 to P-529; C-314 to P-529; E-315 to P-529; S-316 to P-529; P-317 to P-529; 
N-318 to P-529; L-319 to P-529; N-320 to P-529; G-321 to P-529; L-322 to P-529; Y- 
323 to P-529; 1-324 to P-529; F-325 to P-529; G-326 to P-529; D-327 to P-529; F-328 to 
P-529; M-329 to P-529; S-330 to P-529; G-331 to P-529; R-332 to P-529; L-333 to P- 
529; M-334 to P-529; A-335 to P-529; L-336 to P-529; Q-337 to P-529; E-338 to P-529; 
D-339 to P-529; R-340 to P-529; K-341 to P-529; N-342 to P-529; K-343 to P-529; K- 
344 to P-529; W-345 to P-529; K-346 to P-529; K-347 to P-529; Q-348 to P-529; D-349 
to P-529; L-350 to P-529; C-351 to P-529; L-352 to P-529; G-353 to P-529; S-354 to P- 
529; T-355 to P-529; T-356 to P-529; S-357 to P-529; C-358 to P-529; A-359 to P-529; 
F-360 to P-529; P-361 to P-529; G-362 to P-529; L-363 to P-529; 1-364 to P-529; S-365 
to P-529; T-366 to P-529; H-367 to P-529; S-368 to P-529; K-369 to P-529; F-370 to P- 
529; 1-371 to P-529; 1-372 to P-529; S-373 to P-529; F-374 to P-529; A-375 to P-529; E- 
376 to P-529; D-377 to P-529; E-378 to P-529; A-379 to P-529; G-380 to P-529; E-381 
to P-529; L-382 to P-529; Y-383 to P-529; F-384 to P-529; L-385 to P-529; A-386 to P- 
529; T-387 to P-529; S-388 to P-529; Y-389 to P-529; P-390 to P-529; S-391 to P-529; 
A-392 to P-529; Y-393 to P-529; A-394 to P-529; P-395 to P-529; R-396 to P-529; G- 
397 to P-529; S-398 to P-529; 1-399 to P-529; Y-400 to P-529; K-401 to P-529; F-402 to 
P-529; V-403 to P-529; D-404 to P-529; P-405 to P-529; S-406 to P-529; R-407 to P- 
529; R-408 to P-529; A-409 to P-529; P-410 to P-529; P-411 to P-529; G-412 to P-529; 
K-413 to P-529; C-414 to P-529; K-415 to P-529; Y-416 to P-529; K-417 to P-529; P- 
418 to P-529; V-419 to P-529; P-420 to P-529; V-421 to P-529; R-422 to P-529; T-423 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

16 

to P-529; K-424 to P-529; S-425 to P-529; K-426 to P-529; R-427 to P-529; 1-428 to P- 
529; P-429 to P-529; F-430 to P-529; R-431 to P-529; P-432 to P-529; L-433 to P-529; 
A-434 to P-529; K-435 to P-529; T-436 to P-529; V-437 to P-529; L-438 to P-529; D- 
439 to P-529; L-440 to P-529; L-441 to P-529; K-442 to P-529; E-443 to P-529; Q-444 
5 to P-529; S-445 to P-529; E-446 to P-529; K-447 to P-529; A-448 to P-529; A-449 to P- 
529; R-450 to P-529; K-451 to P-529; S-452 to P-529; S-453 to P-529; S-454 to P-529; 
A-455 to P-529; T-456 to P-529; L-457 to P-529; A-458 to P-529; S-459 to P-529; G- 
460 to P-529; P-461 to P-529; A-462 to P-529; Q-463 to P-529; G-464 to P-529; L-465 
to P-529; S-466 to P-529; E-467 to P-529; K-468 to P-529; G-469 to P-529; S-470 to P- 
10 529; S-471 to P-529; K-472 to P-529; K-473 to P-529; L-474 to P-529; A-475 to P-529; 
S-476 to P-529; P-477 to P-529; T-478 to P-529; S-479 to P-529: S-480 to P-529; K-481 
to P-529; N-482 to P-529; T-483 to P-529; L-484 to P-529; R-485 to P-529; G-486 to P- 
529; P-487 to P-529; G-488 to P-529; T-489 to P-529; K-490 to P-529; K-491 to P-529; 
K-492 to P-529; A-493 to P-529; R-494 to P-529; V-495 to P-529; G-496 to P-529; P- 
15 497 to P-529; H-498 to P-529; V-499 to P-529; R-500 to P-529; Q-501 to P-529; G-502 
to P-529; K-503 to P-529; R-504 to P-529; R-505 to P-529; K-506 to P-529; S-507 to P- 
529; L-508 to P-529; K-509 to P-529; S-510 to P-529; H-51 1 to P-529; S-512 to P-529; 
G-513 to P-529; R-514 to P-529; M-515 to P-529; R-516 to P-529; P-517 to P-529; S- 
518 to P-529; A-519 to P-529; E-520 to P-529; Q-521 to P-529; K-522 to P-529; R-523 
20 to P-529; A-524 to P-529; of SEQ ID NO:29. Polypeptides encoded by these 
polynucleotides are also encompassed by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities (e.g., ability 
25 to illicit mitogenic activity, induce differentiation of normal or malignant cells, bind to 
EGF receptors, etc.)), may still be retained. For example the ability to induce and/or bind 
to antibodies which recognize the complete or mature forms of the polypeptide generally 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

17 

will be retained when less than the majority of the residues of the complete or mature 
polypeptide are removed from the C-terminus. Whether a particular polypeptide lacking 
C-terminal residues of a complete polypeptide retains such immunologic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. It is not unlikely that a mutein with a large number of deleted C-terminal amino acid 
residues may retain some biological or immunogenic activities. In fact, peptides 
composed of as few as six amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 
polypeptide shown in Figures 1 A-C up to the glutamine residue at position number 7, 
and polynucleotides encoding such polypeptides. In particular, the present invention 
provides polypeptides comprising the amino acid sequence of residues 1-ml of Figures 
1 A-C, where ml is an integer from 7 to 528 corresponding to the position of the amino 
acid residue in Figures 1 A-C. Moreover, the invention provides polynucleotides 
encoding polypeptides comprising, or alternatively consisting of, the amino acid 
sequence of C-terminal deletions of the polypeptide of the invention shown as SEQ ID 
NO:29 include polypeptides comprising the amino acid sequence of residues: M-l to L- 
528; M-l to S-527; M-l to R-526; M-l to G-525; M-l to A-524; M-l to R-523; M-l to 
K-522;M-1 toQ-521;M-l toE-520; M-l to A-519;M-1 toS-518;M-l toP-517;M-l to 
R-516; M-l to M-515; M-l to R-514; M-l to G-513; M-l to S-512; M-l to H-511; M-l 
to S-510; M-l to K-509; M-l to L-508; M-l to S-507; M-l to K-506; M-l to R-505; M-l 
to R-504; M-l to K-503; M-l to G-502; M-l to Q-501; M-l to R-500; M-l to V-499; M- 
1 to H-498; M-l to P-497; M-l to G-496; M-l to V-495; M-l to R-494; M-l to A-493; 
M-l to K-492; M-l to K-491; M-l to K-490; M-l to T-489; M-l to G-488; M-l to P- 
487; M-l to G-486; M-l to R-485; M-l to L-484; M-l to T-483; M-l to N-482; M-l to 
K-481; M-l to S-480; M-l to S-479; M-l to T-478; M-l to P-477; M-l to S-476; M-l to 
A-475; M-l to L-474; M-l to K-473; M-l to K-472; M-l to S-471; M-l to S-470; M-l to 
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G-469; M-l to K-468; M-l to E-467; M-l to S-466; M-l to L-465; M-l to G-464; M-l 
to Q-463; M-l to A-462; M-l to P-461; M-l to G-460; M-l to S-459; M-l to A-458; M- 
1 to L-457; M-l to T-456; M-l to A-455; M-l to S-454; M-l to S-453; M-l to S-452; M- 
1 to K-451; M-l to R-450; M-l to A-449; M-l to A-448; M-l to K-447; M-l to E-446; 
5 M-l to S-445; M-l to Q-444; M-l to E-443; M- 1 to K-442; M-l to L-441; M-l to L-440; 
M-l to D-439; M-l to L-438; M-l to V-437; M-l to T-436; M-l to K-435; M-l to A- 
434; M-l to L-433; M-l to P-432; M-l to R-431; M-l to F-430; M-l to P-429; M-l to I- 
428; M-l to R-427; M-l to K-426; M-l to S-425; M-l to K-424; M-l to T-423; M-l to 
R-422; M-l to V-421; M-l to P-420; M-l to V-419; M-l to P-418; M-l to K-417; M-l 

10 to Y-416;M-1 toK-415;M-l toC-414;M-l toK-413;M-l toG-412;M-l toP-411;M- 
1 to P-410; M-l to A-409: M-l to R-408; M-l to R-407; M-l to S-406; M-l to P-405; 
M-l to D-404; M-l to V-403; M-l to F-402; M-l to K-401; M-l to Y-400; M-l to 1-399; 
M-l to S-398; M-l to G-397; M-l to R-396; M-l to P-395; M-l to A-394; M-l to Y- 
393; M-l to A-392; M-l to S-391; M-l to P-390; M-l to Y-389; M-l to S-388; M-l to 

15 T-387; M-l to A-386; M-l to L-385; M-l to F-384; M-l to Y-383; M-l to L-382; M-l to 
E-381; M-l to G-380; M-l to A-379; M-l to E-378; M-l to D-377; M-l to E-376; M-l 
to A-375; M-l to F-374; M-l to S-373; M-l to 1-372; M-l to 1-371; M-l to F-370; M-l 
to K-369; M-l to S-368; M-l to H-367; M-l to T-366; M-l to S-365; M-l to 1-364; M-l 
to L-363; M-l to G-362; M-l to P-361; M-l to F-360; M-l to A-359; M-l to C-358; M-l 

20 to S-357; M-l to T-356; M-l to T-355; M-l to S-354; M-l to G-353; M-l to L-352; M-l 
to C-351; M-l to L-350; M-l to D-349; M-l to Q-348; M-l to K-347; M-l to K-346; M- 
1 to W-345; M-l to K-344; M-l to K-343; M-l to N-342; M-l to K-341; M-l to R-340; 
M-l to D-339; M-l to E-338; M-l to Q-337; M-l to L-336; M-l to A-335; M-l to M- 
334; M-l to L-333; M-l to R-332; M-l to G-331; M-l to S-330; M-l to M-329; M-l to 

25 F-328; M-l to D-327; M-l to G-326; M-l to F-325; M-l to 1-324; M-l to Y-323; M-l to 
L-322; M-l to G-321; M-l to N-320; M-l to L-319; M-l to N-318; M-l to P-317; M-l 
toS-316; M-l toE-315;M-l toC-314;M-l toG-313;M-l toR-312; M-l toY-311;M- 
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1 to V-310; M-l to Y-309; M-l to G-308; M-l to G-307; M-l to T-306; M-l to V-305; 
M-l to S-304; M-l to K-303; M-l to G-302; M-l to V-301; M-l to A-300; M-l to H- 
299; M-l to G-298; M-l to Y-297; M-l to A-296; M-l to Y-295; M-l to 1-294; M-l to 
P-293; M-l to L-292; M-l to V-291; M-l to D-290; M-l to D-289; M-l to L-288; M-l 
5 to S-287; M-l to A-286; M-l to N-285; M-l to H-284; M-l to C-283; M-l to L-282; M- 
1 to K-281; M-l to K-280; M-l to D-279; M-l to Y-278; M-l to C-277; M-l to A-276; 
M-l to F-275; M-l to G-274; M-l to E-273; M-l to K-272; M-l to A-271; M-l to R- 
270; M-l to W-269; M-l to G-268; M-l to Y-267; M-l to N-266; M-l to G-265; M-l to 
G-264; M-l to K-263; M-l to L-262; M-l to 1-261; M-l to L-260; M-l to D-259; M-l to 
10 V-258; M-l to E-257; M-l to E-256; M-l lo F-255; M-l to R-254; M-l to N-253; M-l to 
Q-252; M-l to G-251; M-l to V-250; M-l to D-249; M-l to G-248; M-l to C-247; M-l 
to F-246; M-l to 1-245; M-l to R-244; M-l to G-243; M-l to R-242; M-l to G-241; M-l 
to Q-240; M-l to R-239: M-l toT-238; M-l to 1-237; M-l to P-236; M-l toD-235; M-l 
to G-234; M-l to R-233; M-l to D-232; M-l to V-231; M-l to A-230; M-l to C-229; M- 
15 1 to R-228; M-l to W-227; M-l to M-226; M-l to N-225; M-l to R-224; M-l to 1-223; 
M-l to G-222; M-l to Y-221; M-l to A-220; M-l to Y-2I9; M-l to 1-218; M-l to A- 
217; M-l toP-216; M-l to H-215; M-l to A-214; M-l toG-213;M-l toP-212; M-l to 
E-21 1; M-l to S-210; M-l to V-209; M-l to F-208; M-l to P-207; M-l to N-206; M-l to 
D-205; M-l to S-204; M-l to P-203; M-l to V-202; M-l to R-201; M-l to Y-200; M-l 
20 to R-199; M-l to K-198; M-l to G-197; M-l to H-196; M-l to S-195; M-l to G-194; M- 
1 to A-193; M-l to R-192; M-l to N-191; M-l to V-190; M-l to D-189; M-l to 1-188; 
M-l to R-187; M-l to L-186; M-l to V-185; M-l to K-184; M-l to G-183; M-l to L- 
182; M-l to L-181; M-l to S-180; M-l to S-179; M-l to K-178; M-l to N-177; M-l to 
Q-176; M-l toA-175;M-l toN-174;M-l toG-173; M-l toF-172;M-l toL-171;M-l 
25 to G-170; M-l to F-169; M-l to P-168; M-l to D-167; M-l to G-166; M-l to A-165; M- 
1 toQ-164;M-l toG-163;M-l toG-162; M-l toD-161;M-l toG-160; M-l toT-159; 
M-l to F-158; M-l to 1-157; M-l to Y-156; M-l to M-155; M-l to Y-154; M-l to G- 
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153; M-l to D-152; M-l to L-I51; M-l to G-150; M-l to F-149; M-l toL-148; M-i to 
L-147;M-1 toQ-146; M-l to G-145;M-1 toG-144;M-l toN-143;M-l toH-142; M-l 
to N-141; M-l to S-140; M-l to A-139; M-l to P-138; M-l to E-137; M-l to E-136; M-l 
to 1-135; M-l to E-134; M-l to L-133; M-l to 1-132; M-l to V-131; M-l to R-130; M-l 
5 to E-129; M-l to S-128; M-l to K-127; M-l to L-126; M-l to D-125; M-l to A-124; M- 
1 toK-123;M-l toN-122: M-l toP-121;M-l toD-120; M-l toA-119;M-l to R-118; 
M-l to S-117; M-l to V-116; M.l toK-115;M-l to M-l 14; M-l to E-l 13; M-l toS- 
112; M-l to 1-111; M-l to R-110; M-l to 1-109; M-l to K-108; M-l to E-107; M-l to V- 
106; M-l to K-105; M-l to K-104; M-l to K-103; M-l to D-102; M-l to L-101; M-l to 

10 C-100; M-l to S-99; M-l to Y-98; M-l to Y-97: M-l to 1-96; M-l to Y-95; M-l to F-94; 
M-l to K-93; M-l to R-92; M-l to N-91; M-l to H-90; M-l to R-89; M-l to F-88; M-l 
to K-87; M-l to P-86; M-l to H-85; M-l to F-84; M-l to A-83; M-l to L-82; M-l to G- 
81; M-l to L-80; M-l to F-79; M-l to G-78; M-l to R-77; M-l to E-76; M-l to D-75; M- 
1 to G-74; M-l to 1-73; M-l to W-72; M-l to P-71; M-l to T-70; M-l to T-69; M-l to L- 

15 68; M-l to V-67; M-l to 1-66; M-l to N-65; M-l to K-64; M-l to L-63; M-l to D-62; M- 
1 to L-61; M-l to F-60; M-l to P-59; M-l to Q-58; M-l to E-57; M-l to L-56; M-l to R- 
55; M-l toS-54;M-l toG-53;M-l toD-52;M-l toP-51;M-l toL-50; M-l to Y-49; M- 
1 to V-48; M-l to W-47; M-l to V-46; M-l to V-45; M-l to G-44; M-l to V-43; M-l to 
Q-42; M-l to E-41; M-l to A-40; M-l to V-39; M-l to F-38; M-l to F-37: M-l to R-36; 

20 M-l to H-35; M-l to T-34; M-l to G-33; M-l to D-32; M-l to G-31; M-l to A-30; M-l 
to H-29; M-l to V-28; M-l to M-27; M-l to S-26; M-l to V-25; M-l to P-24; M-l to N- 
23; M-l to R-22; M-l to L-21; M-l to G-20; M-l to N-19; M-l to A-18; M-l to V-17; 
M-l to E-16; M-l to S-15; M-l to L-14; M-l to C-13; M-l to L-12; M-l to Q-ll; M-l to 
L-10; M-l to C-9; M-l to G-8; M-l to Q-7; of SEQ ID N0.29. Polypeptides encoded by 

25 these polynucleotides are also encompassed by the invention. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, developmental disorders, and degenerative disorders; osteoarthritis, and lung 
cancer. Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of developing 
tissues, cartilage, and bone, expression of this gene at significantly higher or lower levels 
is routinely detected in certain tissues or cell types (e.g. bone, lung, cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid and 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 29 as residues: Asp-52 to Glu-57, Arg-89 to Tyr-95, Asp-102 to 
Glu-107, Ser-1 17 to Ser-128, Glu-137 to Gly-145, Arg-192 to Arg-199, Val-231 to Gly- 
243, Val-250 to Glu-256, Arg-312 to Asn-318, GIu-338 to Asp-349, Pro-405 to Lys-417, 
Thr-423 to IIe-428, Lys-442 to Ser-453, Glu-467 to Ala-475, Thr-478 to Arg-494, Pro- 
497 to Arg-526. Polynucleotides encoding said polypeptides are also provided. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 1 1 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 2595 of SEQ ID NO: 1 1, b is an integer of 15 to 
2609, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 1 1 , and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

The translation product of this gene, sometimes referred to herein as TIDE (for 
5 Ten Integrin Domains with EGF homology), shares sequence homology with integrins, 
which are a superfamily of dimeric ab cell-surface glycoproteins that mediate the 
adhesive functions of many cell types, enabling cells to interact with one another and 
with the extracellular matrix (See Genomics 56, 169-178 (1999); all information and 
references contained within this publication are hereby incorporated herein by reference). 

10 Eight human integrin b subunits have been described to date, and in combination with the 
12 known a subunits form a large family of heterodimeric cell surface receptors that 
mediate cell adhesion to counter-receptors on neighboring cells, and to ECM proteins 
(reviewed by Hynes, 1992). lntegrin-ligand interactions are crucial for fundamental 
biological processes such as cell migration and motility, and lymphocyte extravasation. 

15 In another embodiment, polypeptides comprising the amino acid sequence of the 

open reading frame upstream of the predicted signal peptide are contemplated by the 
present invention. Specifically, polypeptides of the invention comprise the following 
amino acid sequence: 

TSTPPRAVPLPKSSQAAHQRNCNSGWSPGPASLGVRGSVCPAICWWHLS 
20 LLPPPSVNPTLQKCSSPGAAQELSMRPPGFRNFLLLASSLLFAGLSAVPQSFSPSLR 
SWPGAACRLSRAESERRCRAPGQPPGAALCHGRGRCDCGVCICHVTEPGMFFGP 
LCECHEWVCETYDGSTCAGHGKCDCGKCKCDQGWYGDACQYPTNCDLTKKK 
SNQMCKNSQDIICSNAGTCHCGRCKCDNSDGSGLVYGKFCECDDRECIDDETEEI 
CGGHGKCYCGNCYCKAGWHGDKCEFQCDITPWESKRRCTSPDGKICSNRGTCV 
25 CGECTCHDVDPTGDWGDIHGDTCECDERDCRAVYDRYSDDFCSGHGQCNCGR 
CDCKAGWYGKKCEHPQSCTLSAEESIRKCQGSSDLPCSGRGKCECGKCTCYPPG 
DRRVYGKTCECDDRRCEDLDGVVCGGHGTCSCGRCVCERGWFGKLCQHPRKC 
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NMTEEQSKNLCESADGILCSGKGSCHCGKCICSAEEWYISGEFCDCDDRDCDKH 
DGLICTGNGICSCGNCECWDGWNGNACEI WLGSEYP (SEQ ID NO:50). 
Polynucleotides encoding these polypeptides are also provided. 

Included in this invention as preferred domains are EGF-like domain signature 1 
and 2 domains, which were identified using the ProSite analysis tool (Swiss Institute of 
Bioinformatics). A sequence of about thirty to forty amino-acid residues long found in 
the sequence of epidermal growth factor (EGF) has been shown [1 to 6] to be present, in 
a more or less conserved form, in a large number of other, mostly animal proteins. The 
functional significance of EGF domains in what appear to be unrelated proteins is not yet 
clear. However, a common feature is that these repeats are found in the extracellular 
domain of membrane-bound proteins or in proteins known to be secreted (exception: 
prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have 
been shown (in EGF) to be involved in disulfide bonds. The main structure is a two- 
stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. 
Subdomains between the conserved cysteines strongly vary in length as shown in the 
following schematic representation of the EGF-like domain: 

+ + + ___ + 

I I.I I 

Jc(4)-C-x(0,48)-C-x(3,12)-C-x(l,70)-C-x(1.6)-C-x(2)-O-a-x(0,2X)-G-x(2)-C-x 
I I *** 

+ + 

'C: conserved cysteine involved in a disulfide bond. 'G': often conserved glycine 'a*: 
often conserved aromatic amino acid '**: position of both patterns. Y: any residue The 
region between the 5th and 6th cysteine contains two conserved glycines of which at 
least one is present in most EGF-like domains. The concensus pattern is as follows: C-x- 
C-x(5)-G-x(2)-C [The 3 C's are involved in disulfide bonds]. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: GKCDCGKCKCDQGWYGDACQYPTNCDLTK (SEQ ID NO: 51), 
SUBSTITUTE SHEET (RULE 26) 
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GGHGKCYCGNCYCKAGWHGDKCEFQCDIT (SEQ ID NO:52), 
HGQCNCGRCDCKAGWYGKKCEHPQSCTLS (SEQ ID NO: 53), 
HGTCSCGRCVCERGWFGKLCQHPRKCNMT (SEQ ID NO: 54), 
GNGICSCGNCECWDGWNGNACEIWLGSEY (SEQ ID NO: 55), and 
5 ICGGHGKCYCGNCYCKAGWHGDKCEFQCDITPWESK (SEQ ID NO: 73). 
Polynucleotides encoding these polypeptides are also provided. 

Further preferred are polypeptides comprising the EGF-like domain signature 1 
and 2 domains of the sequence referenced in Table I for this gene, and at least 5, 10, 15, 
20, 25, 30, 50, or 75 additional contiguous amino acid residues of this referenced 

10 sequence. The additional contiguous amino acid residues is N-terminal or C- terminal to 
the EGF-like domain signature 1 and 2 domains. 

Alternatively, the additional contiguous amino acid residues is both N-terminal 
and C-terminal to the EGF-like domain signature 1 and 2 domains, wherein the total N- 
and C-terminal contiguous amino acid residues equal the specified number. The above 

15 preferred polypeptide domain is characteristic of a signature specific to EGF-like domain 
1 and 2 containing proteins. Based on the sequence similarity, the translation product of 
this gene is expected to share at least some biological activities with EGF-like containing 
proteins. Such activities are known in the art, some of which are described elsewhere 
herein. 

20 Included in this invention as preferred domains are integrins beta chain cysteine- 

rich domains, which were identified using the ProSite analysis tool (Swiss Institute of 
Bioinformatics). Integrins [7,8] are a large family of cell surface receptors that mediate 
cell to cell as well as cell to matrix adhesion. Some integrins recognize the R-G-D 
sequence in their extracellular matrix protein ligand. Structurally, integrins consist of a 

25 dimer of an alpha and a beta chain. Each subunit has a large N-terminal extracellular 

domain followed by a transmembrane domain and a short C-terminal cytoplasmic region. 
Some receptors share a common beta chain while having different alpha chains. All the 
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integrin beta chains contain four repeats of a forty amino acid region in the C-terminal 
extremity of their extracellular domain. Each of the repeats contains eight cysteines. The 
concensus pattern is as follows: C-x-[GNQ]-x(l,3)-G-x-C-x-C-x(2)-C-x-C [The five C's 
are probably involved in disulfide bonds]. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: GQPPGAALCHGRGRCDCGVCICHVTEPGMFFGPLC (SEQ ID NO: 74), 
ETYDGSTCAGHGKCDCGKCKCDQGWYGDACQYP (SEQ ID NO:58), 
MCKNSQDIICSNAGTCHCGRCKCDNSDGSGLVYG (SEQ ID NO:59), 
IDDETEEICGGHGKCYCGNCYCKAGWHGDKC (SEQ ID NO:60), 
KRRCTSPDGKICSNRGTCVCGECTCHDVDPTGDW (SEQ ID NO:61), 
DRYSDDFCSGHGQCNCGRCDCKAGWYGKKCEHPQ (SEQ ID NO:62), 
CQGSSDLPCSGRGKCECGKCTCYPPGDRRVYGK (SEQ ID NO:63), 
CEDLDGVVCGGHGTCSCGRCVCERGWFGKLC (SEQ ID NO:64), 
SADGILCSGKGSCHCGKCICSAEEWYISGEFC (SEQ ID NO:65), and 
CDKHDGLICTGNGICSCGNCECWDGWNGNACEI (SEQ ID NO: 66). 
Polynucleotides encoding these polypeptides are also provided. 

Further preferred are polypeptides comprising the integrins beta chain cysteine- 
rich domain of the sequence referenced in Table XIII for this gene, and at least 5, 10, 1 5, 
20, 25, 30, 50, or 75 additional contiguous amino acid residues of this referenced 
sequence. The additional contiguous amino acid residues is N-terminal or C- terminal to 
the integrins beta chain cysteine-rich domain. 

Alternatively, the additional contiguous amino acid residues is both N-terminal 
and C-terminal to the integrins beta chain cysteine-rich domain, wherein the total N- and 
C-terminal contiguous amino acid residues equal the specified number. The above 
preferred polypeptide domain is characteristic of a signature specific to integrin proteins. 
Based on the sequence similarity, the translation product of this gene is expected to share 
at least some biological activities with integrin proteins, and specifically those containing 
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an integrins beta chain cysteine-rich domain. Such activities are known in the art, some 
of which are described elsewhere herein. The following publications were referenced 
above and are hereby incorporated herein by reference: [ 1] Davis C.G., New Biol. 
2:410-419(1990); [ 2] Blomquist M.C, Hunt L.T., Barker W.C., Proc. Natl. Acad. Sci. 
5 U.S.A. 81:7363-7367(1984); [ 3] Barker W.C., Johnson G.C., HuntL.T., George D.G., 
Protein Nucl. Acid Enz. 29:54-68(1986); [ 4] Doolittle R.F., Feng D.F., Johnson M.S., 
Nature 307:558-560(1984); [ 5] AppellaR, Weber I.T., Blasi F., FEBS Lett. 231:1- 
4(1988); [ 6] Campbell I.D., Bork P., Curr. Opin. Struct. Biol. 3:385-392(1993); [ 7] 
Hynes R.O., Cell 48:549-554(1987); and [ 8] Albelda S.M., Buck C.A., FASEB J. 

10 4:2868-2880(1990). 

The polypeptide of the present invention has been putatively identified as a 
member of the integrin family and has been termed Ten Integrin Domains with EGF 
homology ("TIDE"). This identification has been made as a result of amino acid 
sequence homology to the human integrin beta-8 subunit (See Genbank Accession No. 

15 gi|J84521). 

Figures 4A-C shows the nucleotide (SEQ ID NO: 12) and deduced amino acid 
sequence (SEQ ID NO:30) of TIDE. Predicted amino acids from about 1 to about 23 
constitute the predicted signal peptide (amino acid residues from about 1 to about 23 in 
SEQ ID NO:30) and are represented by the underlined amino acid regions; amino acids 

20 from about 108 to about 136, from about 195 to about 223, from about 291 to about 319, 
from about 379 to about 407, and/or from about 465 to about 493 constitute the predicted 
EGF-like domain signature 1 and 2 domains (amino acids from about 108 to about 136, 
from about 195 to about 223, from about 291 to about 319, from about 379 to about 407, 
and/or from about 465 to about 493 in SEQ ID NO:30) and are represented by the double 

25 underlined amino acids; and amino acids from about 55 to about 89, from about 97 to 
about 129, from about 142 to about 175, from about 186 to about 216, from about 228 to 
about 261, from about 281 to about 314, from about 327 to about 359, from about 368 to 
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about 398, from about 417 to about 448, and/or from about 455 to about 487 constitute 
the predicted integrins beta chain cysteine-rich domains (amino acids from about 55 to 
about 89, from about 97 to about 129, from about 142 to about 175, from about 186 to 
about 216, from about 228 to about 261, from about 281 to about 314, from about 327 to 
about 359, from about 368 to about 398, from about 417 to about 448, and/or from about 
455 to about 487 in SEQ ID NO:30) and are represented by the shaded amino acids. 

Figure 5 shows the regions of similarity between the amino acid sequences of the 
Ten Integrin Domains with EGF homology (TIDE) protein (SEQ ID NO:30) and the 
human integrin beta-8 subunit (SEQ ID NO: 67). 

Figure 6 shows an analysis of the Ten Integrin Domains with EGF homology 
(TIDE) amino acid sequence. Alpha, beta, turn and coil regions; hydrophilicity and 
hydrophobicity; amphipathic regions; flexible regions; antigenic index and surface 
probability are shown. 

A polynucleotide encoding a polypeptide of the present invention is obtained 
from human osteoblasts, synovial hypoxia tissue, osteoblast and osteoclast, bone marrow 
stromal cells, umbilical vein, smooth muscle, placenta, and fetal lung. The 
polynucleotide of this invention was discovered in a human osteoblast II cDNA library. 
Its translation product has homology to the characteristic integrins beta chain cysteine- 
rich domains of integrin family members. The polynucleotide contains an open reading 
frame encoding the TIDE polypeptide of 494 amino acids. TIDE exhibits a high degree 
of homology at the amino acid level to the human integrin beta-8 subunit (as shown in 
Figure 5). 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the TIDE polypeptide having the amino acid sequence shown in 
Figures 4A-C (SEQ ID NO:30). The nucleotide sequence shown in Figures 4A-C (SEQ 
ID NO: 12) was obtained by sequencing a cloned cDNA (HOHCH55), which was 
deposited on November 17 at the American Type Culture Collection, and given 
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Accession Number 203484. The present invention is further directed to fragments of the 
isolated nucleic acid molecules described herein. By a fragment of an isolated DNA 
molecule having the nucleotide sequence of the deposited cDNA or the nucleotide 
sequence shown in SEQ ID NO: 12 is intended DNA fragments at least about 15nt, and 
5 more preferably at least about 20 nt, still more preferably at least about 30 nt, and even 
more preferably, at least about 40 nt in length which are useful as diagnostic probes and 
primers as discussed herein. Of course, larger fragments 50-1500 nt in length are also 
useful according to the present invention, as are fragments corresponding to most, if not 
all, of the nucleotide sequence of the deposited cDNA or as shown in SEQ ID NO: 12. By 

10 a fragment at least 20 nt in length, for example, is intended fragments which include 20 
or more contiguous bases from the nucleotide sequence of the deposited cDNA or the 
nucleotide sequence as shown in SEQ ID NO: 12. In this context "about" includes the 
particularly recited size, larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at 
either terminus or at both termini. Representative examples of TIDE polynucleotide 

15 fragments of the invention include, for example, fragments that comprise, or 

alternatively, consist of, a sequence from about nucleotide 1 to about 50, from about 51 
to about 100, from about 101 to about 150, from about 151 to about 200, from about 201 
to about 250, from about 251 to about 300, from about 301 to about 350, from about 351 
to about 400, from about 401 to about 450, from about 451 to about 500, from about 501 

20 to about 550, from about 55 1 to about 600, from about 601 to about 650, from about 65 1 
to about 700, from about 701 to about 750, from about 751 to about 800, from about 801 
to about 850, from about 851 to about 900, from about 901 to about 950, from about 951 
to about 1000, from about 1001 to about 1050, from about 1051 to about 1100, from 
about 1 101 to about 1 150, from about 1 151 to about 1200, from about 1201 to about 

25 1250, from about 1251 to about 1300, from about 1301 to about 1350, from about 1351 
to about 1400, from about 1401 to about 1450, from about 1451 to about 1500, from 
about 1501 to about 1550, from about 1551 to about 1600, from about 1601 to about 
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1650, from about 1651 to about 1700, from about 1701 to about 1750, from about 1751 
to about 1800, from about 1801 to about 1850, from about 1851 to about 1900, from 
about 1901 to about 1950, from about 1951 to about 2000, from about 2001 to about 
2050, from about 205 1 to about 2100, from about 2101 to about 2150, from about 2151 
to about 2200, from about 2201 to about 2250, from about 2251 to about 2300, from 
about 2301 to about 2350, from about 2351 to about 2400, from about 2401 to about 
2450, from about 245 1 to about 2499, from about 289 to about 1705, and/or from about 
221 to about 1705 of SEQ ID NO: 12, or the complementary strand thereto, or the cDNA 
contained in the deposited gene. In this context "about" includes the particularly recited 
ranges, larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at 
both termini. 

Preferred nucleic acid fragments of the present invention include nucleic acid 
molecules encoding a member selected from the group: a polypeptide comprising or 
alternatively, consisting of, the mature TIDE protein (amino acid residues from about 
221 to about 1705 in Figures 4A-C (amino acids from about 221 to about 1705 in SEQ 
ID NO:30). Since the location of these domains have been predicted by computer 
analysis, one of ordinary skill would appreciate that the amino acid residues constituting 
these domains may vary slightly (e.g., by about 1 to 15 amino acid residues) depending 
on the criteria used to define each domain. In additional embodiments, the 
polynucleotides of the invention encode functional attributes of TIDE. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions of TIDE. The data 
representing the structural or functional attributes of TIDE set forth in Figure 6 and/or 
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Table II, as described above, was generated using the various modules and algorithms of 
the DNA*STAR set on default parameters. In a preferred embodiment, the data 
presented in columns VIII, IX, XIII, and XIV of Table II can be used to determine 
regions of TIDE which exhibit a high degree of potential for antigenicity. Regions of 
high antigenicity are determined from the data presented in columns VIII, IX, XIII, 
and/or XIV by choosing values which represent regions of the polypeptide which are 
likely to be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 6, but may, as 
shown in Table II, be represented or identified by using tabular representations of the 
data presented in Figure 6. The DNA*STAR computer algorithm used to generate Figure 
6 (set on the original default parameters) was used to present the data in Figure 6 in a 
tabular format (See Table II). The tabular format of the data in Figure 6 is used to easily 
determine specific boundaries of a preferred region. The above-mentioned preferred 
regions set out in Figure 6 and in Table II include, but are not limited to, regions of the 
aforementioned types identified by analysis of the amino acid sequence set out in Figures 
4A-C. As set out in Figure 6 and in Table II, such preferred regions include Garnier- 
Robson alpha-regions, beta-regions, turn-regions, and coil-regions, Chou-Fasman alpha- 
regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic regions and Hopp- 
Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic regions, Karpius- 
Schulz flexible regions, Jameson-Wolf regions of high antigenic index and Emini 
surface-forming regions. Even if deletion of one or more amino acids from the N- 
terminus of a protein results in modification of loss of one or more biological functions 
of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened TIDE 
muteins to induce and/or bind to antibodies which recognize the complete or mature 
forms of the polypeptides generally will be retained when less than the majority of the 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

31 

residues of the complete or mature polypeptide are removed from the N-terminus. 
Whether a particular polypeptide lacking N-terminal residues of a complete polypeptide 
retains such immunologic activities can readily be determined by routine methods 
described herein and otherwise known in the art. It is not unlikely that an TIDE mutein 
5 with a large number of deleted N-terminal amino acid residues may retain some 

biological or immunogenic activities. In fact, peptides composed of as few as six TIDE 
amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the TIDE amino acid sequence shown 
1 0 in Figures 4A-C, up to the leucine residue at position number 489 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues n 1-494 of Figures 4A-C, where nl is an 
integer from 2 to 489 corresponding to the position of the amino acid residue in Figures 
4A-C (which is identical to the sequence shown as SEQ ID NO:30). In another 

15 embodiment, N-terminal deletions of the TIDE polypeptide can be described by the 
general formula n2-494, where n2 is a number from 2 to 489, corresponding to the 
position of amino acid identified in Figures 4A-C. N-terminal deletions of the TIDE 
polypeptide of the invention shown as SEQ ID NO:30 include polypeptides comprising 
the amino acid sequence of residues: N-terminal deletions of the TIDE polypeptide of the 

20 invention shown as SEQ ID NO:30 include polypeptides comprising the amino acid 
sequence of residues: R-2 to P-494; P-3 to P-494; P-4 to P-494; G-5 to P-494; F-6 to P- 
494; R-7 to P-494; N-8 to P-494; F-9 toP-494; L-10 to P-494; L-l 1 to P-494; L-12 to P- 
494; A-13 to P-494; S-14 to P-494; S-15 to P-494; L-16 to P-494; L-17 to P-494;F-18 to 
P-494; A-19 to P-494; G-20 to P-494; L-21 to P-494; S-22 to P-494; A-23 to P-494; V- 

25 24 to P-494; P-25 to P-494; Q-26to P-494; S-27 to P-494; F-28 to P-494; S-29 to P-494; 
P-30 to P-494; S-31 to P-494; L-32 to P-494; R-33 to P-494; S-34 toP-494; W-35 to P- 
494; P-36 to P-494; G-37 to P-494; A-38 to P-494; A-39 to P-494; C-40 to P-494; R-41 
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to P-494; L-42 toP-494; S-43 to P-494; R-44 to P-494; A-45 to P-494; E-46 to P-494; S- 
47 to P-494; E-48 to P-494; R-49 to P-494; R-50 toP-494; C-51 to P-494; R-52 to P-494; 
A-53 to P-494; P-54 to P-494; G-55 to P-494; Q-56 to P-494; P-57 to P-494; P-58 toP- 
494; G-59 to P-494; A-60 to P-494; A-61 to P-494; L-62 to P-494; C-63 to P-494; H-64 
5 to P-494; G-65 to P-494; R-66 toP-494; G-67 to P-494; R-68 to P-494; C-69 to P-494; D- 
70 to P-494; C-71 to P-494; G-72 to P-494; V-73 to P-494; C-74 toP-494; 1-75 to P-494; 
C-76 to P-494; H-77 to P-494; V-78 to P-494; T-79 to P-494; E-80 to P-494; P-81 to P- 
494; G-82 to P-494;M-83 to P-494; F-84 to P-494; F-85 to P-494; G-86 to P-494; P-87 to 
P-494; L-88 to P-494; C-89 to P-494; E-90 to P-494; C-91to P-494; H-92 to P-494; E-93 

10 to P-494: W-94 to P-494; V-95 to P-494; C-96 to P-494; E-97 to P-494; T-98 to P-494; 
Y-99 toP-494; D-100 to P-494; G-101 to P-494; S-102 to P-494; T-103 to P-494: C-104 
to P-494; A- 105 to P-494; G-106 to P-494;H-107 to P-494; G-108 to P-494; K-109 to P- 
494; C-l 10 to P-494; D-l 11 to P-494; C-112 to P-494; G-l 13 to P-494; K-l 14 toP-494; 
C-l 15 to P-494: K-l 16 to P-494; C-l 17 to P-494; D-l 18 to P-494; Q-l 19 to P-494; G- 

15 120 to P-494; W-121 to P-494;Y-122 to P-494; G-123 to P-494; D-124 to P-494; A-125 
to P-494; C-l 26 to P-494; Q-l 27 to P-494; Y-128 to P-494; P-129 toP-494; T-130 to P- 
494; N-131 to P-494; C-132 to P-494; D-133 to P-494; L-134 to P-494; T-135 to P-494; 
K-136 to P-494;K-137 to P-494; K-138 to P-494; S-139 to P-494; N-140 to P-494; Q- 
141 to P-494; M-142 to P-494; C-143 to P-494; K-144 toP-494; N-145 to P-494; S-146 

20 to P-494; Q-147 to P-494; D-148 to P-494; 1-149 to P-494; 1-150 to P-494; C-151 to P- 
494; S-152to P-494; N-153 to P-494; A-154 to P-494; G-155 to P-494; T-156 to P-494; 
C-157 to P-494; H-158 to P-494; C-159 to P-494;G-160 to P-494; R-161 to P-494; C-162 
to P-494; K-163 to P-494; C-164 to P-494; D-165 to P-494; N-166 to P-494; S-167 toP- 
494; D-168 to P-494; G-169 to P-494; S-170 to P-494; G-I71 to P-494; L-172 to P-494; 

25 V-173 to P-494; Y-174 to P-494;G-175 to P-494; K-176 to P-494; F-177 to P-494; C-178 
to P-494; E-179 to P-494; C-180 to P-494; D-181 to P-494; D-182 toP-494; R-183 to P- 
494; E-184 to P-494; C-185 to P-494; 1-186 to P-494; D-187 to P-494; D-188 to P-494; 
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E-189 to P-494; T-190to P-494; E-191 to P-494; E-192 to P-494; 1-193 to P-494; C-194 
to P-494; G-195 to P-494; G-196 to P-494; H-197 to P-494;G-198 to P-494; K-199 to P- 
494; C-200 to P-494; Y-201 to P-494; C-202 to P-494; G-203 to P-494; N-204 to P-494; 
C-205 toP-494; Y-206 to P-494; C-207 to P-494; K-208 to P-494; A-209 to P-494; G- 
5 210 to P-494; W-21 1 to P-494; H-212 to P-494;G-213 to P-494; D-214 to P-494; K-215 
to P-494; C-216 to P-494; E-217 to P-494; F-218 to P-494; Q-219 to P-494; C-220 toP- 
494; D-221 to P-494; 1-222 to P-494; T-223 to P-494; P-224 to P-494; W-225 to P-494; 
E-226 to P-494; S-227 to P-494;K-228 to P-494; R-229 to P-494; R-230 to P-494; C-231 
to P-494; T-232 to P-494; S-233 to P-494; P-234 to P-494; D-235 toP-494; G-236 to P- 

10 494; K-237 to P-494; 1-238 to P-494; C-239 to P-494; S-240 to P-494; N-241 to P-494; 
R-242 to P-494;G-243 to P-494; T-244 to P-494; C-245 to P-494; V-246 to P-494; C-247 
to P-494; G-248 to P-494; E-249 to P-494; C-250 toP-494; T-251 to P-494; C-252 to P- 
494; H-253 to P-494; D-254 to P-494; V-255 to P-494; D-256 to P-494; P-257 to P- 
494;T-258 to P-494; G-259 to P-494; D-260 to P-494; W-261 to P-494; G-262 to P-494; 

15 D-263 to P-494; 1-264 to P-494; H-265 toP-494; G-266 to P-494; D-267 to P-494; T-268 
to P-494; C-269 to P-494; E-270 to P-494; C-271 to P-494; D-272 to P-494;E-273 to P- 
494; R-274 to P-494; D-275 to P-494; C-276 to P-494; R-277 to P-494; A-278 to P-494; 
V-279 to P-494; Y-280 toP-494; D-281 to P-494; R-282 to P-494; Y-283 to P-494; S-284 
to P-494; D-285 to P-494; D-286 to P-494; F-287 to P-494;C-288 to P-494; S-289 to P- 

20 494; G-290 to P-494; H-291 to P-494; G-292 to P-494; Q-293 to P-494; C-294 to P-494; 
N-295 toP-494; C-296 to P-494; G-297 to P-494; R-298 to P-494; C-299 to P-494; D- 
300 to P-494; C-301 to P-494; K-302 to P-494;A-303 to P-494; G-304 to P-494; W-305 
to P-494; Y-306 to P-494; G-307 to P-494; K-308 to P-494; K-309 to P-494; C-310 toP- 
494; E-311 to P-494; H-312 to P-494; P-313 to P-494; Q-314 to P-494; S-315 to P-494; 

25 C-316 to P-494; T-317 to P-494;L-318 to P-494; S-319 to P-494; A-320 to P-494; E-321 
to P-494; E-322 to P-494; S-323 to P-494; 1-324 to P-494; R-325 toP-494; K-326 to P- 
494; C-327 to P-494; Q-328 to P-494; G-329 to P-494; S-330 to P-494; S-331 to P-494; 
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D-332 to P-494;L-333 to P-494; P-334 to P-494; C-335 to P-494; S-336 to P-494; G-337 
to P-494; R-338 to P-494; G-339 to P-494; K-340 toP-494; C-341 to P-494; E-342 to P- 
494; C-343 to P-494; G-344 to P-494; K-345 to P-494; C-346 to P-494; T-347 to P- 
494;C-348 to P-494; Y-349 to P-494; P-350 to P-494; P-351 to P-494; G-352 to P-494; 
5 D-353 to P-494; R-354 to P-494; R-355 toP-494; V-356 to P-494; Y-357 to P-494; G- 
358 to P-494; K-359 to P-494; T-360 to P-494; C-361 to P-494; E-362 to P-494;C-363 to 
P-494; D-364 to P-494; D-365 to P-494; R-366 to P-494; R-367 to P-494; C-368 to P- 
494; E-369 to P-494; D-370 toP-494; L-371 to P-494; D-372 to P-494; G-373 to P-494; 
V-374 to P-494; V-375 to P-494; C-376 to P-494; G-377 to P-494;G-378 to P-494; H- 

10 379 to P-494; G-380 to P-494; T-381 to P-494; C-382 to P-494; S-383 to P-494; C-384 
to P-494; G-385 toP-494; R-386 to P-494; C-387 to P-494; V-388 to P-494; C-389 to P- 
494; E-390 to P-494; R-391 to P-494; G-392 to P-494;W-393 to P-494; F-394 to P-494; 
G-395 to P-494; K-396 to P-494; L-397 to P-494; C-398 to P-494; Q-399 to P-494; H- 
400 toP-494; P-401 to P-494; R-402 to P-494; K-403 to P-494; C404 to P-494; N-405 to 

15 P-494; M-406 to P-494; T-407 to P-494;E-408 to P-494; E-409 to P-494; Q-410 to P- 
494; S-4J 1 to P-494; K-412 to P-494; N-413 to P-494; L-414 to P-494; C-415 toP-494; 
E-416 to P-494; S-417 to P-494; A-418 to P-494; D-419 to P-494; G-420 to P-494; 1-421 
to P-494; L-422 to P-494; C-423to P-494; S-424 to P-494; G-425 to P-494; K-426 to P- 
494; G-427 to P-494; S-428 to P-494; C-429 to P-494; H-430 to P-494;C-431 to P-494; 

20 G-432 to P-494; K-433 to P-494; C-434 to P-494; 1-435 to P-494; C-436 to P-494; S-437 
to P-494; A-438 toP-494; E-439 to P-494; E-440 to P-494; W-441 to P-494; Y-442 to P- 
494; 1-443 to P-494; S-444 to P-494; G-445 to P-494;E-446 to P-494; F-447 to P-494; C- 
448 to P-494; D-449 to P-494; C-450 to P-494; D-451 to P-494; D-452 to P-494; R-453 
toP-494; D-454 to P494; C-455 to P-494; D-456 to P-494; K-457 to P-494; H-458 to P- 

25 494; D-459 to P-494; G-460 to P-494;L-461 to P-494; 1-462 to P-494; C-463 to P-494; 
T-464 to P-494; G-465 to P-494; N-466 to P-494; G-467 to P-494; 1-468 toP-494; C-469 
to P-494; S-470 to P-494; C-471 to P-494; G-472 to P-494; N-473 to P-494; C-474 to P- 
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494; E-475 to P-494;C-476 to P-494; W-477 to P-494; D-478 to P-494; G-479 to P-494; 
W-480 to P-494; N-481 to P-494; G-482 to P-494; N-483 toP-494; A-484 to P-494; C- 
485 to P-494; E-486 to P-494; 1-487 to P-494; W-488 to P-494; L-489 to P-494; of SEQ 
ID NO:30. Polypeptides encoded by these polynucleotides are also encompassed by the 
invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities may still be retained. For example the 
ability of the shortened TIDE mutein to induce and/or bind to antibodies which recognize 
the complete or mature forms of the polypeptide generally will be retained when less 
than the majority of the residues of the complete or mature polypeptide are removed 
from the C-terminus. Whether a particular polypeptide lacking C-terminal residues of a 
complete polypeptide retains such immunologic activities can readily be determined by 
routine methods described herein and otherwise known in the art. It is not unlikely that 
an TIDE mutein with a large number of deleted C-terminal amino acid residues may 
retain some biological or immunogenic activities. In fact, peptides composed of as few as 
six TIDE amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the TIDE 
polypeptide shown in Figures 4A-C, up to the phenylalanine residue at position number 
6, and polynucleotides encoding such polypeptides. In particular, the present invention 
provides polypeptides comprising the amino acid sequence of residues 1-ml of Figure 1, 
where ml is an integer from 6 to 494 corresponding to the position of the amino acid 
residue in Figures 4A-C. Moreover, the invention provides polynucleotides encoding 
polypeptides comprising, or alternatively consisting of, the amino acid sequence of C- 
terminal deletions of the TIDE polypeptide of the invention shown as SEQ ID NO:30 
include polypeptides comprising the amino acid sequence of residues: M-l to Y-493; M- 
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1 to E-492; M-l to S-491; M-l to G-490; M-l to L-489; M-l to W-488; M-l toI-487; M- 
1 to E-486; M-l to C-485; M-l to A-484; M-l to N-483; M-l to G-482; M-l to N-481; 
M-l to W-480; M-l to G-479;M-1 to D-478; M-l to W-477; M-l to C-476; M-l to E- 
475; M-l to C-474; M-l to N-473; M-l to G-472; M-l to C-471; M-l toS-470; M-l to 
5 C-469; M-l to 1-468; M-l to G-467; M-l to N-466; M-l to G-465; M-l to T-464; M-l to 
C-463; M-l to 1-462; M-lto L-461; M-l to G-460; M-l to D-459; M-l to H-458; M-l to 
K-457; M-l to D-456; M-l to C-455; M-l to D-454; M-l to R-453;M-1 to D-452: M-l 
to D-451; M-l to C-450; M-l to D-449; M-l to C-448; M-l to F-447; M-l to E-446; M- 
1 to G-445; M-l toS-444; M-l to 1-443; M-l to Y-442; M-l to W-441; M-l to E-440; M- 

10 1 to E-439; M-l to A-438; M-l to S-437; M-l to C-436; M-lto 1-435; M-l to C-434; M- 
1 to K-433; M-l to G-432; M-l to C-431; M-l to H-430; M-l to C-429; M-l to S-428; 
M-l to G-427;M-1 to K-426; M-l to G-425; M-l to S-424; M-l to C-423; M-l to L-422; 
M-l to 1-421; M-l to G-420; M-l toD-419; M-l toA-418; M-l to S-417; M-l toE-416; 
M-l to C-415; M-l to L-414; M-l to N-413; M-l to K-412; M-l to S-411; M-l to Q- 

15 410;M-1 to E-409; M-l to E-408; M-l to T-407; M-l to M-406; M-l to N-405; M-l to 
C-404; M-l to K-403; M-l to R-402; M-l toP-401; M-l to H-400; M-l to Q-399; M-l to 
C-398; M-l to L-397; M-l to K-396; M-l to G-395; M-l to F-394; M-l to W-393;M-1 
to G-392; M-l to R-391; M-l to E-390; M-l to C-389; M-l to V-388; M-l to C-387; M- 
1 to R-386; M-l to G-385; M-l toC-384; M-l to S-383; M-l to C-382; M-l to T-381; M- 

20 1 to G-380; M-l to H-379; M-l to G-378; M-l to G-377; M-l to C-376;M-1 to V-375; 
M-l to V-374; M-l to G-373; M-l to D-372; M-l to L-371; M-l to D-370; M-l to E- 
369; M-l to C-368; M-l toR-367; M-l to R-366; M-l to D-365; M-l to D-364; M-l to 
C-363; M-l to E-362; M-l to C-361; M-l to T-360; M-l to K-359;M-1 to G-358; M-l to 
Y-357; M-l to V-356; M-l to R-355; M-l to R-354; M-l to D-353; M-l to G-352; M-l 

25 to P-351; M-l toP-350; M-l to Y-349; M-l to C-348; M-l to T-347; M-l to C-346; M-l 
to K-345; M-l to G-344; M-l to C-343; M-l to E-342;M-1 to C-341; M-l to K-340; M-l 
to G-339; M-l to R-338; M-l to G-337; M-l to S-336; M-l to C-335; M-l to P-334; M-l 
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toL-333; M-l to D-332; M-l to S-331 ; M-l to S-330; M-l to G-329; M-l to Q-328; M-l 
to C-327; M-l to K-326; M-l to R-325;M-1 to 1-324; M-l to S-323; M-l to E-322; M-l 
to E-321; M-l to A-320; M-l to S-319; M-l to L-318; M-l to T-317; M-l toC-316; M-l 
to S-315; M-l to Q-314; M-l to P-313; M-l to H-312; M-l to E-31 1; M-l to C-310; M-l 
to K-309; M-l to K-308;M-1 to G-307; M-l to Y-306; M-l to W-305; M-l to G-304; M- 
1 to A-303; M-l to K-302; M-l to C-301; M-l to D-300; M-l toC-299; M-l to R-298; 
M-l to G-297; M-l to C-296; M-l to N-295; M-l to C-294; M-l to Q-293; M-l to G- 
292; M-l to H-291;M-1 to G-290; M-l to S-289; M-l to C-288; M-l to F-287; M-l to 
D-286; M-l to D-285; M-l to S-284; M-l to Y-283; M-l toR-282; M-l to D-281; M-l to 
Y-280; M-l to V-279; M-l to A-278; M-l to R-277; M-l to C-276; M-l to D-275; M-l 
toR-274;M-l toE-273;M-l toD-272;M-l toC-271;M-l toE-270; M-l toC-269;M-l 
to T-268; M-l to D-267; M-l to G-266; M-l toH-265; M-l to 1-264; M-l to D-263; M-l 
to G-262; M-l to W-261; M-l to D-260; M-l to G-259; M-l to T-258; M-l to P-257;M- 
1 to D-256; M-l to V-255; M-l to D-254; M-l to H-253; M-l to C-252; M-l to T-251; 
M-l to C-250; M-l to E-249; M-l toG-248; M-l to C-247; M-l to V-246; M-l to C-245; 
M-l to T-244; M-l to G-243; M-l to R-242; M-l to N-241; M-l to S-240;M-1 to C-239; 
M-l to 1-238; M-l to K-237; M-l to G-236; M-l to D-235; M-l to P-234; M-l to S-233; 
M-l to T-232; M-l toC-231; M-l to R-230; M-l to R-229; M-l to K-228; M-l to S-227; 
M-l to E-226; M-l to W-225; M-l to P-224; M-l to T-223;M-1 to 1-222; M-l to D-221; 
M-l to C-220; M-l to Q-219; M-l to F-218; M-l to E-217; M-l to C-216; M-l to K- 
215; M-l toD-214; M-l to G-213; M-l to H-212; M-l to W-211; M-l to G-210; M-l to 
A-209; M-l to K-208; M-l to C-207; M-l to Y-206;M-1 to C-205; M-l to N-204; M-l 
to G-203; M-l to C-202; M-l to Y-201; M-l to C-200; M-l to K-199; M-l to G-198; M- 
1 toH-197; M-l to G-196; M-l to G-195; M-l to C-194; M-l to 1-193; M-l to E-192; M- 
1 toE-191;M-l toT-190; M-l toE-189; M-lto D-188; M-l toD-187;M-l to 1-186; M- 
1 to C-185; M-l to E-184; M-l to R-183; M-l to D-182; M-l to D-181; M-l to C- 
180;M-1 to E-179; M-l to C-178; M-l to F-177; M-l to K-176; M-l to G-175; M-l to 
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Y-174; M-l to V-173; M-l to L-172; M-l toG-171; M-l to S-170; M-l to G-169; M-l to 
D-168; M-l to S-167; M-l to N-166; M-l to D-165; M-l to C-164; M-l to K-163;M-1 to 
C-162; M-l to R-161; M-l to G-160; M-l to C-159; M-l to H-158; M-l to C-157; M-l 
to T-156; M-l to G-155; M-l toA-154; M-l to N-153; M-l to S-152; M-l to C-151; M-l 
5 to 1-150; M-l to 1-149; M-l to D-148; M-l to Q-147; M-l to S-146; M-lto N-145; M-l 
to K-144; M-l to C-143; M-l to M-142; M-l to Q-141; M-l to N-140; M-l to S-139: M- 
1 to K-138; M-l to K-137;M-1 to K-136; M-l toT-135; M-l to L-134; M-l to D-133; 
M-l toC-132;M-l toN-131;M-l toT-130;M-l toP-129;M-l toY-128;M-l toQ-127; 
M-l toC-126;M-l to A-125:M-1 toD-124; M-l toG-123;M-l to Y-122; M-l to W- 

10 121; M-l toG-120;M-l to Q-119; M-l toD-1 18: M-l toC-117;M-l toK-116;M-l to 
C-l 15; M-l to K-l 14; M-l to G-113; M-l to C-l 12; M-l toD-111; M-l to C-110; M-l to 
K-109; M-l to G-108; M-l to H-107; M-l to G-106; M-l to A-105; M-l to C-104; M-l 
to T-103;M-1 to S-102; M-l to G-101; M-l to D-100; M-l to Y-99; M-l to T-98; M-l to 
E-97; M-l to C-96; M-l to V-95; M-l to W-94;M-1 to E-93; M-l to H-92; M-l to C-91; 

15 M-l to E-90; M-l to C-89; M-l to L-88; M-l to P-87; M-l to G-86; M-l to F-85; M-l 
toF-84; M-l to M-83; M-l to G-82; M-l to P-81; M-l to E-80; M-l to T-79; M-l to V- 
78; M-l to H-77; M-l to C-76; M-l to I-75;M-1 to C-74; M-l to V-73; M-l to G-72; M- 
1 to C-71; M-l to D-70; M-l to C-69; M-l to R-68; M-l to G-67; M-l to R-66; M-l 
toG-65; M-l to H-64; M-l to C-63; M-l to L-62; M-l to A-61; M-l to A-60; M-l to G- 

20 59; M-l to P-58; M-l to P-57; M-l to Q-56;M-1 to G-55; M-l to P-54; M-l to A-53; M- 
1 to R-52; M-l to C-51; M-l to R-50; M-l to R-49; M-l to E-48; M-l to S-47; M-l toE- 
46; M-l to A-45; M-l to R-44; M-l to S-43; M-l to L-42; M-l to R-41; M-l to C-40; M- 
1 to A-39; M-l to A-38; M-l to G-37;M-1 to P-36; M-l to W-35; M-l to S-34; M-l to 
R-33; M-l to L-32; M-l to S-31; M-l to P-30; M-l to S-29; M-l to F-28; M-l toS-27; 

25 M-l to Q-26; M-l to P-25; M-l to V-24; M-l to A-23; M-l to S-22; M-l to L-21; M-l to 
G-20; M-l toA-19;M-l toF-18;M-l toL-17;M-l toL-16;M-l toS-15;M-l toS-14; 
M-l to A-13;M-1 toL-12; M-l toL-ll;M-l toL-10;M-l toF-9;M-l toN-8; M-l toR- 
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7; M-i to F-6; of SEQ ID NO:30. Polypeptides encoded by these polynucleotides are 
also encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO:12 which have been determined 
from the following related cDNA genes: HLHFV34R (SEQ ID NO:68), HSRDA85R 
(SEQ ID NO:69), HSRAZ62R (SEQ ID NO:70), HSRDA17R (SEQ ID NO:71), and 
HSLEC45R (SEQ ID NO:72). 

Based on the sequence similarity to the human integrin beta-8 subunit, translation 
product of this gene is expected to share at least some biological activities with integrin 
proteins, and specifically the human integrin beta-8 subunit. Such activities are known in 
the art, some of which are described elsewhere herein. 

Specifically, polynucleotides and polypeptides of the invention are also useful for 
modulating the differentiation of normal and malignant cells, modulating the 
proliferation and/or differentiation of cancer and neoplastic cells, and modulating the 
immune response. Polynucleotides and polypeptides of the invention may represent a 
diagnostic marker for hematopoietic and immune diseases and/or disorders. The full- 
length protein should be a secreted protein, based upon homology to the integrin family. 
Therefore, it is secreted into serum, urine, or feces and thus the levels is assayabie from 
patient samples. Assuming specific expression levels are reflective of the presence of 
immune disorders, this protein would provide a convenient diagnostic for early detection. 
In addition, expression of this gene product may also be linked to the progression of 
immune diseases, and therefore may itself actually represent a therapeutic or therapeutic 
target for the treatment of cancer. Polynucleotides and polypeptides of the invention may 
play an important role in the pathogenesis of human cancers and cellular transformation, 
particularly those of the immune and hematopoietic systems. Polynucleotides and 
polypeptides of the invention may also be involved in the pathogenesis of developmental 
abnormalities based upon its potential effects on proliferation and differentiation of cells 
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and tissue cell types. Due to the potential proliferating and differentiating activity of said 
polynucleotides and polypeptides, the invention is useful as a therapeutic agent in 
inducing tissue regeneration, for treating inflammatory conditions (e.g., inflammatory 
bowel syndrome, diverticulitis, etc.). Moreover, the invention is useful in modulating the 
immune response to aberrant polypeptides, as may exist in rapidly proliferating cells and 
tissue cell types, particularly in adenocarcinoma cells, and other cancers. 

Alternatively, the expression within cellular sources marked by proliferating cells 
indicates this protein may play a role in the regulation of cellular division, and may show 
utility in the diagnosis, treatment, and/or prevention of developmental diseases and 
disorders, including cancer, and other proliferative conditions. Representative uses are 
described in the "Hyperproliferative Disorders" and "Regeneration" sections below and 
elsewhere herein. Briefly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell death, 
as occurs in the development of some cancers, or in failure to control the extent of cell 
death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 

Alternatively, this gene product is involved in the pattern of cellular proliferation 
that accompanies early embryogenesis. Thus, aberrant expression of this gene product in 
tissues - particularly adult tissues - may correlate with patterns of abnormal cellular 
proliferation, such as found in various cancers. Because of potential roles in proliferation 
and differentiation, this gene product may have applications in the adult for tissue 
regeneration and the treatment of cancers. It may also act as a morphogen to control cell 
and tissue type specification. Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus this protein may 
modulate apoptosis or tissue differentiation and is useful in the detection, treatment, 
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and/or prevention of degenerative or proliferative conditions and diseases. The protein is 
useful in modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the protein 
may also be used to determine biological activity, to raise antibodies, as tissue markers, 
to isolate cognate ligands or receptors, to identify agents that modulate their interactions, 
in addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. 

The translation product of this gene, sometimes referred to herein as TIDE (for 
Ten Integrin Domains with EGF homology), shares sequence homology with integrins, 
which are a superfamily of dimeric ab cell-surface glycoproteins that mediate the 
adhesive functions of many cell types, enabling cells to interact with one another and 
with the extracellular matrix (See Genomics 56, 169-178 (1999); all information and 
references contained within this publication are hereby incorporated herein by reference). 

The gene encoding the disclosed cDNA is believed to reside on chromosome 13, 
at locus 13q33. Accordingly, polynucleotides related to this invention are useful as a 
marker in linkage analysis for chromosome 13, generally, and particularly at locus 
13q33. 

This gene is expressed primarily in synovial hypoxia tissue, osteoblast and 
osteoclast, bone marrow stromal cells, and to a lesser extent in umbilical vein, smooth 
muscle, placenta, and fetal lung cDNA libraries. Therefore, polynucleotides and 
polypeptides of the invention are useful as reagents for differential identification of the 
tissue(s) or cell type(s) present in a biological sample and for diagnosis of diseases and 
conditions which include, but are not limited to, disorders of bone and connective tissues, 
immune and hematopoietic diseases and/or disorders, vascular disorders, and other 
disorders involving aberrations in cell-surface interactions. Similarly, polypeptides and 
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antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the connective tissue and skeletal system, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g. cartilage, bone, vascular, hypoxic tissue, and cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 30 as residues: Met-1 to Phe-6, Arg-44 to Arg-52, His-64 to Cys- 
69, Tyr-99 to Gln-147, His-158 to Gly-169, Phe-177 to Asp-182, Cys-194 to Cys-202, 
Gly-213 to Phe-218, Pro-224 to Gly-236, Asp-254 to Trp-261, Asp-263 to Ala-303, Trp- 
305 to Cys-316, Lys-326 to Asp-332, Pro-334 to Cys-343, Pro-350 to Asp-370, Thr-407 
to Asn-413, Gly-425 to Cys-431, Asp-449 to Asp-459, Gly-472 to Asn-483. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution and homology to the human integrin beta-8 subunit 
indicates polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and treatment of a variety of immune system disorders. Representative uses are 
described in the "Immune Activity" and "infectious disease" sections below, in Example 
11, 13, 14, 16, 18, 19, 20, and 27, and elsewhere herein. 

Briefly, the expression indicates a role in regulating the proliferation; survival; 
differentiation; and/or activation of hematopoietic cell lineages, including blood stem 
cells. Involvement in the regulation of cytokine production, antigen presentation, or other 
processes indicates a usefulness for treatment of cancer (e.g. by boosting immune 
responses). Expression in cells of lymphoid origin, indicates the natural gene product is 
involved in immune functions. Therefore it would also be useful as an agent for 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

43 

immunological disorders including arthritis, asthma, immunodeficiency diseases such as 
AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory bowel 
disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, such as T- 
cell mediated cytotoxicity; immune reactions to transplanted organs and tissues, such as 
host-versus-graft and graft-versus-host diseases, or autoimmunity disorders, such as 
autoimmune infertility, lense tissue injury, demyelination, systemic lupus erythematosis, 
drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's Disease, and 
scleroderma. Moreover, the protein may represent a secreted factor that influences the 
differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 
sites of injury. Thus, this gene product is thought to be useful in the expansion of stem 
cells and committed progenitors of various blood lineages, and in the differentiation 
and/or proliferation of various cell types. Based upon the tissue distribution of this 
protein, antagonists directed against this protein is useful in blocking the activity of this 
protein. Accordingly, preferred are antibodies which specifically bind a portion of the 
translation product of this gene. 

Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 
product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 
for the translation product of this gene to a bodily fluid from the individual, preferably 
serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 
above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
are more particularly described elsewhere herein. 

Furthermore, the protein may also be used to determine biological activity, raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. Protein, 
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as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
5 ID NO: 12 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
10 b, where a is any integer between 1 to 2485 of SEQ ID NO: 12, b is an integer of 15 to 
2499, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 12, and where b is greater than or equal to a + 14. 



1 5 FEATURES OF PROTEIN ENCODED BY GENE NO: 3 



The translation product of this gene shares sequence homology with RAMP3 
(receptor-activity-modifying proteins), which another group has recently published, 
which is thought to be important in the transport of the calcitonin-receptor-Iike receptor 

20 (CRLR) to the plasma membrane. RAMPs regulate the transport and ligand specificity of 
the calcitonin-receptor-like-receptor. There are two other related receptor-activity- 
modifying proteins, known as RAMP1 and RAMP2 (Nature 1998 May 
28;393(6683):333-9). RAMP1 is thought to present the receptor at the cell surface as a 
mature glycoprotein and a Calcitonin-gene-related peptide (CGRP) receptor. 

25 Alternatively, RAMP2-tran sported receptors are core-glycosylated and are 

adrenomedullin receptors. CGRP (a 37-amino-acid neuropeptide) and its receptors are 
widely distributed in the body, and it is the most potent endogenous vasodilatory peptide 
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discovered so far (Crit Rev Neurobiol 1997;1 1(2-3): 167-239). Specific binding sites for 
adrenomedullin were present in every region of human brain (cerebral cortex, 
cerebellum, thalamus, hypothalamus, pons and medulla oblongata), suggesting that a 
novel neurotransmitter/neuromodulator role may exist for adrenomedullin in human 
5 brain (Peptides 1997;18(8): 1 125-9). 

Figures 7A-B show the nucleotide (SEQ ID NO: 13) and deduced amino acid 
sequence (SEQ ID NO:31) of the Intestine derived extracellular protein. Predicted amino 
acids from about 1 to about 27 constitute the predicted signal peptide (amino acid 
residues from about 1 to about 27 in SEQ ID NO:31) and are represented by the 
10 underlined amino acid regions; and amino acids from about 122 to about 138 constitute 
the predicted transmembrane domain (amino acid residues from about 122 to about 138 
in SEQ ID NO:31) and are represented by the double-underlined amino acids. 

Figure 8 shows the regions of similarity between the amino acid sequences of the 
Intestine derived extracellular protein SEQ ID NO:31, and the RAMP3 protein 
15 (gi|4587099) (SEQ ID NO: 75). 

Figure 9 shows an analysis of the amino acid sequence of SEQ ID NO: 31. 
Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; 
amphipathic regions; flexible regions; antigenic index and surface probability are shown. 
Northern analysis indicates that a 1.4kb transcript of this gene is primarily 
20 expressed in small intestine tissue, and to a lesser extent in colon and prostate tissue. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 
Figure 1 (SEQ ID NO:31), which was determined by sequencing a cloned cDNA 
(HTLEW81). The nucleotide sequence shown in Figures 7A-B (SEQ ID NO: 13) was 
25 obtained by sequencing a cloned cDNA (HTLEW81), which was deposited on Nov. 17, 
1998 at the American Type Culture Collection, and given Accession Number 203484. 
The deposited gene is inserted in the pSport plasmid (Life Technologies, Rockville, MD) 
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using the Sall/NotI restriction endonuclease cleavage sites. The present invention is 
further directed to fragments of the isolated nucleic acid molecules described herein. By 
a fragment of an isolated DNA molecule having the nucleotide sequence of the deposited 
cDNA or the nucleotide sequence shown in SEQ ID NO: 13 is intended DNA fragments 
5 at least about 15nt, and more preferably at least about 20 nt, still more preferably at least 
about 30 nt, and even more preferably, at least about 40 nt in length which are useful as 
diagnostic probes and primers as discussed herein. Of course, larger fragments 50-1500 
nt in length are also useful according to the present invention, as are fragments 
corresponding to most, if not all, of the nucleotide sequence of the deposited cDNA or as 

10 shown in SEQ ID NO: 13. By a fragment at least 20 nt in length, for example, is intended 
fragments which include 20 or more contiguous bases from the nucleotide sequence of 
the deposited cDNA or the nucleotide sequence as shown in SEQ ID NO: 13. In this 
context "about" includes the particularly recited size, larger or smaller by several (5, 4, 3, 
2, or 1) nucleotides, at either terminus or at both termini. Representative examples of 

15 polynucleotide fragments of the invention include, for example, fragments that comprise, 
or alternatively, consist of, a sequence from about nucleotide 1 to about 50, from about 
51 to about 100, from about 101 to about 150, from about 151 to about 200, from about 
201 to about 250, from about 251 to about 300, from about 301 to about 350, from about 
351 to about 400, from about 401 to about 450, from about 451 to about 500, and from 

20 about 501 to about 550, and from about 551 to about 600, from about 601 to about 650, 
from about 651 to about 700, from about 701 to about 750, from about 751 to about 800, 
from about 801 to about 850, from about 851 to about 900, from about 901 to about 950, 
from about 951 to about 1000, from about 1001 to about 1050, from about 1051 to about 
1 100, from about 1 101 to about 1 150, from about 1 151 to about 1200, from about 1201 

25 to about 1250, from about 1251 to about 1300, from about 1301 to about 1339 of SEQ 
ID NO: 13, or the complementary strand thereto, or the cDNA contained in the deposited 
gene. In this context "about" includes the particularly recited ranges, larger or smaller by 
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several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. In additional 
embodiments, the polynucleotides of the invention encode functional attributes of the 
corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 
5 comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and lurn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions. The data representing 
10 the structural or functional attributes of the protein set forth in Figure 9 and/or Table 1IL 
as described above, was generated using the various modules and algorithms of the 
DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XIV of Table 1111 can be used to determine regions of the 
protein which exhibit a high degree of potential for antigenicity. Regions of high 
15 antigenicity are determined from the data presented in columns VIII, IX, XIII, and/or 
XIV by choosing values which represent regions of the polypeptide which are likely to 
be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 9, but may, as 
20 shown in Table III, be represented or identified by using tabular representations of the 
data presented in Figure 9. The DNA*STAR computer algorithm used to generate Figure 
9 (set on the original default parameters) was used to present the data in Figure 9 in a 
tabular format (See Table III). The tabular format of the data in Figure 9 is used to easily 
determine specific boundaries of a preferred region. The above-mentioned preferred 
25 regions set out in Figure 9 and in Table III include, but are not limited to, regions of the 
aforementioned types identified by analysis of the amino acid sequence set out in Figures 
7A-B. As set out in Figure 9 and in Table III, such preferred regions include Garnier- 
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Robson alpha-regions, beta-regions, turn-regions, and coil-regions, Chou-Fasman alpha- 
regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic regions and Hopp- 
Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic regions, Karplus- 
Schulz flexible regions, Jameson-Wolf regions of high antigenic index and Emini 
5 surface-forming regions. Even if deletion of one or more amino acids from the N- 

terminus of a protein results in modification of loss of one or more biological functions 
of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 

10 polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N-terminus. Whether a particular 
polypeptide lacking N-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 

15 deleted N-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence shown in 

20 Figures 7A-B, up to the arginine residue at position number 143 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues ni-148 of Figures 7A-B, where nl is an 
integer from 2 to 143 corresponding to the position of the amino acid residue in Figures 
7A-B (which is identical to the sequence shown as SEQ ID NO:31). N-terminal deletions 

25 of the polypeptide of the invention shown as SEQ ID NO:31 include polypeptides 

comprising the amino acid sequence of residues: E-2 to L-148; T-3 to L-148; G-4 toL- 
148; A-5 to L-148; L-6 to L-148; R-7 to L-148; R-8 to L-148;P-9 to L-148; Q-10 to L- 
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148; L-l 1 to L-148; L-12 to L-148; P-13to L-148; L-14 to L-148; L-15 to L-148; L-16 to 
L-148; L-17 toL-148; L-18 to L-148; C-19 to L-148; G-20 to L-148; G-21 toL-148; C-22 
to L-148; P-23 to L-148; R-24 to L-148; A-25 toL-148; G-26 to L-148; G-27 to L-148; 
C-28 to L-148; N-29 toL-148; E-30 to L-148; T-31 to L-148; G-32 to L-148; M-33 toL- 
5 148; L-34 to L-148; E-35 to L-148; R-36 to L-148; L-37 toL-148; P-38 to L-148; L-39 to 
L-148; C-40 to L-148; G-41 toL-148; K-42 to L-148; A-43 to L-148; F-44 to L-148; A- 
45 toL-148; D-46 to L-148; M-47 to L-148; M-48 to L-148; G-49 toL- 148; K-50 to L- 
148; V-51 to L-148; D-52 to L-148; V-53 toL-148; W-54 to L-148; K-55 to L-148; W-56 
to L-148; C-57 toL-148; N-58 to L-148; L-59 to L-148; S-60 to L-148; E-61 toL-148; F- 

10 62 to L-148; 1-63 to L-148; V-64 to L-148; Y-65 toL-148; Y-66 to L-148; E-67 to L-148; 
S-68 to L-148; F-69 toL-148; T-70 to L-148: N-71 to L-148; C-72 to L-148; T-73 toL- 
148; E-74 to L-148; M-75 to L-148; E-76 to L-148; A-77 toL-148; N-78 to L-148; V-79 
to L-148; V-80 to L-148; G-81 toL-148; C-82 to L-148; Y-83 to L-148; W-84 to L-148; 
P-85 toL-148; N-86 to L-148; P-87 to L-148; L-88 to L-148; A-89 toL-148; Q-90 to L- 

15 148; G-91 to L-148; F-92 to L-148; 1-93 toL-148; T-94 to L-148; G-95 to L-148; 1-96 to 
L-148; H-97 toL-148; R-98 to L-148; Q-99 to L-148; F-100 to L-148; F-101 toL-148; S- 
102 to L-148; N-103 to L-148; C-104 to L-148; T-105 toL-148; V-106 to L-148; D-107 
to L-148; R-108 to L-148; V-109 toL-148; H-l 10 to L-148; L-ll 1 to L-148; E-l 12 to L- 
148; D-l 13 toL-148; P-l 14 to L-148; P-l 15 to L-148; D-l 16 to L-148; E-l 17 toL-148; 

20 V-l 18 to L-148; L-l 19 to L-148; 1-120 to L-148; P-121 toL-148; L-122 to L-148; 1-123 
to L-148; V-124 to L-148; 1-125 toL-148; P-126 to L-148; V-127 to L-148; V-128 to L- 
148; L-129 toL-148; T-130 to L-148; V-131 to L-148; A-132 to L-148; M-133 toL-148; 
A-134 to L-148; G-135 to L-148; L-136 to L-148; V-137 toL-148; V-138 to L-148; W- 
139 to L-148; R-140 to L-148; S-141 toL-148; K-142 to L-148; R-143 to L-148; of SEQ 

25 ID NO:3 1 . Polypeptides encoded by these polynucleotides are also encompassed by the 
invention. 
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Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities (e.g., ability 
to illicit mitogenic activity, induce differentiation of normal or malignant cells, bind to 
5 EGF receptors, etc.)), may still be retained. For example the ability to induce and/or bind 
to antibodies which recognize the complete or mature forms of the polypeptide generally 
will be retained when less than the majority of the residues of the complete or mature 
polypeptide are removed from the C-terminus. Whether a particular polypeptide lacking 
C-terminal residues of a complete polypeptide retains such immunologic activities can 

10 readily be determined by routine methods described herein and otherwise known in the 
art. It is not unlikely that a mutein with a large number of deleted C-terminal amino acid 
residues may retain some biological or immunogenic activities. In fact, peptides 
composed of as few as six amino acid residues may often evoke an immune response. 
Accordingly, the present invention further provides polypeptides having one or 

15 more residues deleted from the carboxy terminus of the amino acid sequence of the 

polypeptide shown in Figures 7A-B, up to the arginine residue at position number 7, and 
polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues i-ml of Figures 7A-B, 
where ml is an integer from 7 to 147 corresponding to the position of the amino acid 

20 residue in Figures 7A-B. Moreover, the invention provides polynucleotides encoding 
polypeptides comprising, or alternatively consisting of, the amino acid sequence of C- 
terminal deletions of the polypeptide of the invention shown as SEQ ID NO:31 include 
polypeptides comprising the amino acid sequence of residues: M-l to L-147; M-l to T- 
146;M-1 to D-145; M-l to T-144; M-l to R-143; M-l to K-142; M-l toS-141; M-l to R- 

25 140; M-l to W-139; M-l to V-138; M-l to V-137;M-1 to L-136; M-l to G-135; M-l to 
A-134; M-l to M-133; M-l toA-132; M-l to V-131; M-l to T-130; M-l to L-129; M-l 
to V-128;M-1 to V-127; M-l to P-126;M-1 to 1-125; M-l to V-124; M-l toI-123;M-l to 
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L-122;M-1 toP-121;M-l to 1-120; M-l toL-119;M-l toV-118;M-l toE-117;M-l to 
D-116; M-l toP-115;M-l toP-114; M-l toD-113;M-l toE-112;M-l toL-lll;M-l to 
H-l 10;M-1 to V-109; M-l to R-108; M-l to D-107; M-l to V-106; M-l toT-105; M-l to 
C-104;M-1 toN-103;M-l toS-102;M-l toF-101;M-l toF-100; M-l toQ-99;M-l to 
R-98; M-l to H-97; M-l toI-96; M-l to G-95; M-l to T-94; M-l to 1-93; M-l to F-92; 
M-l toG-91; M-l to Q-90; M-l to A-89: M-l to L-88; M-l to P-87; M-l to N-86; M-l to 
P-85;M-1 to W-84; M-l to Y-83; M-l toC-82;M-l toG-81;M-l toV-80; M-l toV-79; 
M-l to N-78; M-l toA-77; M-l to E-76; M-l to M-75; M-l to E-74; M-l to T-73; M-lto 
C-72;M-1 to N-71;M-I toT-70; M-l toF-69: M-l toS-68;M-l toE-67;M-l to Y-66: 
M-l to Y-65; M-l to V-64; M-l to I-63;M-1 to F-62; M-l to E-61; M-l to S-60: M-l to 
L-59; M-l to N-58; M-l to C-57; M-l to W-56; M-l to K-55; M-l to W-54: M-l toV- 
53; M-l toD-52; M-l to V-51;M-1 to K-50;M-1 to G-49; M-lto M-48; M-l toM-47; 
M-l to D-46; M-l to A-45; M-l to F-44;M-1 to A-43; M-l to K-42; M-l to G-41; M-l to 
C-40; M-l toL-39; M-l to P-38: M-l to L-37; M-l to R-36; M-l to E-35; M-l toL-34; 
M-l to M-33; M-l to G-32; M-l to T-31; M-l to E-30; M-lto N-29; M-l to C-28; M-l 
to G-27; M-l to G-26; M- 1 to A-25;M-1 to R-24; M-l to P-23; M-l to C-22; M-l to G- 
21; M-l toG-20; M-l to C-19; M-l to L-18; M-l to L-17; M-l to L-16; M-lto L-15; M-l 
to L-14; M-l to P-13; M-l to L-12; M-l to L-l 1; M-lto Q-10; M-l to P-9; M-l to R-8; 
M-l to R-7; of SEQ ID NO:31. Polypeptides encoded by these polynucleotides are also 
encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO:3 1 which have been determined 
from the following related cDNA genes: HLHCH17RA (SEQ ID NO:76), HTOAT51R 
(SEQ ID NO:77), and/or HBNB041R (SEQ ID NO:78). 

The polypeptide of this gene has been determined to have a transmembrane 
domain at about amino acid position 122 - 138 of the amino acid sequence referenced in 
Table XIII for this gene. Moreover, a cytoplasmic tail encompassing amino acids 139 to 
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149 of this protein has also been determined. Based upon these characteristics, it is 
believed that the protein product of this gene shares structural features to type la 
membrane proteins. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, gastrointestinal and neurodegenerative diseases and disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the central nervous and 
gastrointestinal systems, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g. brain, CNS, gastrointestinal, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovia] fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 31 as residues: Ala-5 to Gln-10, Pro-23 to Cys-28, Arg-140 to 
Asp-145. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution and homology to RAMP3 suggest that the translation 
product of this gene is useful for the detection/treatment of neurodegenerative disease 
states and behavioural disorders such as Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, schizophrenia, mania, dementia, paranoia, 
obsessive compulsive disorder, panic disorder, learning disabilities, ALS, psychoses, 
autism, and altered behaviors, including disorders in feeding, sleep patterns, balance, and 
perception. In addition, the gene or gene product may also play a role in the treatment 
and/or detection of developmental disorders associated with the developing embryo. 
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Alternatively, the tissue distribution in small intestine and colon tissues indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and/or treatment of disorders involving the small intestine. This may include 
diseases associated with digestion and food absorption, as well as hematopoietic 
5 disorders involving the Peyer's patches of the small intestine, or other hematopoietic cells 
and tissues within the body. Similarly, expression of this gene product in colon tissue 
indicates again involvement in digestion, processing, and elimination of food, as well as 
a potential role for this gene as a diagnostic marker or causative agent in the development 
of colon cancer, and cancer in general. Protein, as well as ; antibodies directed against the 
10 protein may show utility as a tumor marker and/or immunotherapy targets for the above 
listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 13 and may have been publicly available prior to conception of the present 

15 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 1325 of SEQ ID NO: 13, b is an integer of 15 to 

20 1339, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 13, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 4 

25 

The translation product of this gene shares sequence homology with a 
proteoglycan from Gallus gallus, and this proteoglycan is believed to participate in the 
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osteogenic processes of cartilage ossification (See Genbank Accession No. gi|222847). 
Based on the sequence similarity. The translation product of this gene is expected to 
share biological activities with the Gallus gallus proteoglycan polypeptide. 

Figures 10A-B shows the nucleotide (SEQ ID NO: 14) and deduced amino acid 
sequence (SEQ ID NO:32) of the retinal specific protein. Predicted amino acids from 
about 1 to about 21 constitute the predicted signal peptide (amino acid residues from 
about 1 to about 21 in SEQ ID NO:32) and are represented by the underlined amino acid 
regions. 

Figure 1 1 shows the regions of similarity between the amino acid sequences of 
the retinal specific protein SEQ ID NO:32, and the Gallus gallus proteoglycan (SEQ ID 
NO:79). 

Figure 12 shows an analysis of the amino acid sequence of SEQ ID NO: 32. 

Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity: 
amphipathic regions; flexible regions; antigenic index and surface probability are shown. 

Northern analysis indicates that this gene is expressed in adrenal cortex and 
adrenal medulla tissues. This gene is also expressed in retinal tissue. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 
Figures 10A-B (SEQ ID NO:32), which was determined by sequencing a cloned cDNA 
(HARA044). The nucleotide sequence shown in Figures 10A-B (SEQ ID NO: 14) was 
obtained by sequencing a cloned cDNA (HARA044), which was deposited on Nov. 17, 
1998 at the American Type Culture Collection, and given Accession Number 203484. 
The deposited gene is inserted in the pSport plasmid (Life Technologies, Rockville, MD) 
using the Sall/NotI restriction endonuclease cleavage sites. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
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NO: 14 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
5 invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO: 14. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 14. In this context "about" includes the particularly recited size, 
10 larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. 

Representative examples of polynucleotide fragments of the invention include, 
for example, fragments that comprise, or alternatively, consist of, a sequence from about 
nucleotide 1 to about 50, from about 51 to about 100, from about 101 to about 150, from 

15 about 151 to about 200, from about 201 to about 250, from about 251 to about 300, from 
about 301 to about 350, from about 351 to about 400, from about 401 to about 450, from 
about 451 to about 500, and from about 501 to about 550, and from about 551 to about 
600, from about 601 to about 650, from about 651 to about 700, from about 701 to about 
750, from about 751 to about 800, from about 801 to about 850, from about 85 1 to about 

20 900, from about 901 to about 950, from about 951 to about 1000, from about 1001 to 
about 1050, from about 1051 to about 1 100, from about 1 101 to about 1 150, from about 
1151 to about 1200, from about 1201 to about 1250, from about 1251 to about 1300, 
from about 1301 to about 1350, from about 1351 to about 1389, and from about 187 to 
about 1 1 19 of SEQ ID NO: 14, or the complementary strand thereto, or the cDNA 

25 contained in the deposited gene. In this context "about" includes the particularly recited 
ranges, larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at 
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both termini. In additional embodiments, the polynucleotides of the invention encode 
functional attributes of the corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions. The data representing 
the structural or functional attributes of the protein set forth in Figure 12 and/or Table IV, 
as described above, was generated using the various modules and algorithms of the 
DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XIV of Table IV can be used to determine regions of the 
protein which exhibit a high degree of potential for antigenicity. Regions of high 
antigenicity are determined from the data presented in columns VIII, IX, XIII, and/or 
XIV by choosing values which represent regions of the polypeptide which are likely to 
be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 12, but may, as 
shown in Table IV, be represented or identified by using tabular representations of the 
data presented in Figure 12. The DNA*STAR computer algorithm used to generate 
Figure 12 (set on the original default parameters) was used to present the data in Figure 
12 in a tabular format (See Table IV). The tabular format of the data in Figure 12 is used 
to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 12 and in Table IV include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figures 10A-B. As set out in Figure 12 and in Table IV, such preferred regions 
include Garnier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, 
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Chou-Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic 
regions and Hopp- Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic 
regions, Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index 
and Emini surface-forming regions. Even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N-terminus. Whether a particular 
polypeptide lacking N-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence shown in 
Figures 10A-B, up to the proline residue at position number 327 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues nl-332 of Figures 10A-B, where nl is an 
integer from 2 to 327 corresponding to the position of the amino acid residue in Figures 
10A-B (which is identical to the sequence shown as SEQ ID NO:32). N-terminal 
deletions of the polypeptide of the invention shown as SEQ ID NO:32 include 
polypeptides comprising the amino acid sequence of residues: R-2 to T-332; L-3 to T- 
332; L-4 to T-332; A-5 to T-332;F-6 to T-332; L-7 to T-332; S-8 to T-332; L-9 to T-332; 
L-10 to T-332; A-1 1 to T-332; L-12 to T-332; V-13 to T-332;L-14 to T-332; Q-15 to T- 
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332; E-16 to T-332; T-17 to T-332; G-18 to T-332; T-19 to T-332; A-20 to T-332; S-21 
toT-332; L-22 to T-332; P-23 to T-332; R-24 to T-332; K-25 to T-332; E-26 to T-332; R- 
27 to T-332; K-28 to T-332;R-29 to T-332; R-30 to T-332; E-31 to T-332; E-32 to T- 
332; Q-33 to T-332; M-34 to T-332; P-35 to T-332; R-36 toT-332; E-37 to T-332; G-38 
5 to T-332; D-39 to T-332; S-40 to T-332; F-41 to T-332; E-42 to T-332; V-43 to T-332;L- 
44 to T-332; P-45 to T-332; L-46 to T-332; R-47 to T-332; N-48 to T-332; D-49 to T- 
332; V-50 to T-332; L-51 toT-332; N-52 to T-332; P-53 to T-332; D-54 to T-332; N-55 
to T-332; Y-56 to T-332; G-57 to T-332; E-58 to T-332;V-59 to T-332; 1-60 to T-332; D- 
61 to T-332; L-62 to T-332; S-63 to T-332; N-64 to T-332; Y-65 to T-332; E-66 toT- 

10 332; E-67 to T-332; L-68 to T-332; T-69 to T-332; D-70 to T-332; Y-71 to T-332; G-72 
to T-332: D-73 to T-332;Q-74 to T-332; L-75 to T-332; P-76 to T-332; E-77 to T-332; 
V-78 to T-332; K-79 to T-332; V-80 to T-332; T-81 toT-332; S-82 to T-332; L-83 to T- 
332; A-84 to T-332; P-85 to T-332; A-86 to T-332; T-87 to T-332; S-88 to T-332;I-89 to 
T-332; S-90 to T-332; P-91 to T-332; A-92 to T-332; K-93 to T-332; S-94 to T-332; T- 

15 95 to T-332; T-96 toT-332; A-97 to T-332; P-98 to T-332; G-99 to T-332; T-100 to T- 
332; P-101 to T-332; S-102 to T-332; S-103 toT-332; N-104 to T-332; P-105 to T-332; 
T-106 to T-332; M-107 to T-332; T-108 to T-332; R-109 to T-332; P-l 10 toT-332; T- 
1 1 1 to T-332; T-112 to T-332; A-l 13 to T-332; G-l 14 to T-332; L-l 15 to T-332; L-l 16 
to T-332; L-l 17 toT-332; S-l 18 to T-332; S-l 19 to T-332; Q-120 to T-332; P-121 to T- 

20 332; N-122 to T-332; H-123 to T-332; G-124 toT-332; L-125 to T-332; P-126 to T-332; 
T-127 to T-332; C-128 to T-332; L-129 to T-332; V-130 to T-332; C-131 toT-332; V- 
132 to T-332; C-133 to T-332; L-134 to T-332; G-135 to T-332; S-136 to T-332; S-137 
to T-332; V-138 toT-332; Y-139 to T-332; C-140 to T-332; D-141 to T-332; D-142 toT- 
332; 1-143 to T-332; D-144 to T-332; L-145 toT-332; E-146 to T-332; D-147 to T-332; 

25 1-148 to T-332; P-149 to T-332; P-150 to T-332; L-151 to T-332; P-152 toT-332; R-153 
to T-332; R-154 to T-332; T-155 to T-332; A-156 to T-332; Y-157 to T-332; L-158 to T- 
332; Y-159 toT-332; A-160 to T-332; R-161 to T-332; F-162 to T-332; N-163 to T-332; 
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R-164 to T-332; 1-165 to T-332; S-166 toT-332; R-167 to T-332; 1-168 to T-332; R-169 
to T-332; A-170 to T-332; E-171 to T-332; D-172 to T-332; F-173 toT-332; K-174 to T- 
332; G-175 to T-332; L-176 to T-332; T-177 to T-332; K-178 to T-332; L-179 to T-332; 
K-180 toT-332; R-181 to T-332; 1-182 to T-332; D-183 to T-332; L-184 to T-332; S-185 
5 to T-332; N-186 to T-332; N-187 toT-332; L-188 to T-332; 1-189 to T-332; S-190 toT- 
332; S-191 to T-332; 1-192 to T-332; D-193 to T-332; N-194 toT-332; D-195 to T-332; 
A-196 to T-332; F-197 to T-332; R-198 to T-332; L-199 to T-332; L-200 to T-332; H- 
201 toT-332; A-202 to T-332; L-203 to T-332; Q-204 to T-332; D-205 to T-332; L-206 
to T-332; 1-207 to T-332; L-208 toT-332; P-209 to T-332; E-210 to T-332; N-21 1 to T- 

10 332; Q-212 to T-332; L-213 to T-332; E-214 to T-332; A-215 toT-332: L-216 to T-332; 
P-217 to T-332; V-218 to T-332; L-219 to T-332; P-220 to T-332; S-221 to T-332: G- 
222 toT-332; 1-223 to T-332; E-224 to T-332; F-225 to T-332; L-226 to T-332: D-227 to 
T-332; V-228 to T-332; R-229 toT-332; L-230 to T-332; N-23 1 to T-332; R-232 to T- 
332; L-233 to T-332; Q-234 to T-332; S-235 to T-332; S-236 toT-332; G-237 to T-332; 

15 1-238 to T-332; Q-239 to T-332; P-240 to T-332; A-241 to T-332; A-242 to T-332; F- 
243 toT-332; R-244 to T-332; A-245 to T-332; M-246 to T-332; E-247 to T-332; K-248 
to T-332; L-249 to T-332; Q-250 toT-332; F-251 to T-332; L-252 to T-332; Y-253 to T- 
332; L-254 to T-332; S-255 to T-332; D-256 to T-332; N-257 toT-332; L-258 to T-332; 
L-259 to T-332; D-260 to T-332; S-261 to T-332; 1-262 to T-332; P-263 to T-332; G-264 

20 toT-332; P-265 to T-332; L-266 to T-332; P-267 to T-332; P-268 to T-332; S-269 to T- 
332; L-270 to T-332; R-271 toT-332; S-272 to T-332; V-273 to T-332; H-274 to T-332; 
L-275 to T-332; Q-276 to T-332; N-277 to T-332; N-278 toT-332; L-279 to T-332; 1-280 
to T-332; E-281 to T-332; T-282 to T-332; M-283 to T-332; Q-284 to T-332; R-285 toT- 
332; D-286 to T-332; V-287 to T-332; F-288 to T-332; C-289 to T-332; D-290 to T-332; 

25 P-291 to T-332; E-292 toT-332; E-293 to T-332; H-294 to T-332; K-295 to T-332; H- 
296 to T-332; T-297 to T-332; R-298 to T-332; R-299 toT-332; Q-300 to T-332; L-301 
to T-332; E-302 to T-332; D-303 to T-332; 1-304 to T-332; R-305 to T-332; L-306 toT- 
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332; D-307 to T-332; G-308 to T-332; N-309 to T-332; P-310 to T-332; 1-311 to T-332; 
N-312 to T-332; L-313 toT-332; S-314 to T-332; L-315 to T-332; F-316 to T-332; P-317 
to T-332; S-318 to T-332; A-319 to T-332; Y-320 toT-332; F-321 to T-332; C-322 to T- 
332; L-323 to T-332; P-324 to T-332; R-325 to T-332; L-326 to T-332; P-327 toT-332; 
5 of SEQ ID NO:32. Polypeptides encoded by these polynucleotides are also encompassed 
by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities (e.g., ability 

10 to illicit mitogenic activity, induce differentiation of normal or malignant cells, bind to 
EGF receptors, etc.)), may still be retained. For example the ability to induce and/or bind 
to antibodies which recognize the complete or mature forms of the polypeptide generally 
will be retained when less than the majority of the residues of the complete or mature 
polypeptide are removed from the C-terminus. Whether a particular polypeptide lacking 

15 C-terminal residues of a complete polypeptide retains such immunologic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. It is not unlikely that a mutein with a large number of deleted C-terminal amino acid 
residues may retain some biological or immunogenic activities. In fact, peptides 
composed of as few as six amino acid residues may often evoke an immune response. 

20 Accordingly, the present invention further provides polypeptides having one or 

more residues deleted from the carboxy terminus of the amino acid sequence of the 
polypeptide shown in Figures 10A-B, up to the glutamine residue at position number 7, 
and polynucleotides encoding such polypeptides. In particular, the present invention 
provides polypeptides comprising the amino acid sequence of residues 1-ml of Figures 

25 10A-B, where ml is an integer from 7 to 331 corresponding to the position of the amino 
acid residue in Figures 10A-B. Moreover, the invention provides polynucleotides 
encoding polypeptides comprising, or alternatively consisting of, the amino acid 
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sequence of C-terminal deletions of the polypeptide of the invention shown as SEQ ID 
NO:32 include polypeptides comprising the amino acid sequence of residues: M-l to F- 
331; M-l to R-330; M-l to G-329; M-l to 1-328; M-l to P-327; M-l to L-326; M-l to R- 
325; M-l to P-324; M-l to L-323; M-l to C-322; M-l to F-321; M-lto Y-320; M-l to A- 
319; M-l toS-318; M-l toP-317;M-l toF-316; M-l toL-315;M-l toS-314; M-l toL- 
313;M-lto N-312; M-l toI-311;M-l toP-310; M-l toN-309; M-l toG-308;M-l toD- 
307; M-l to L-306; M-l to R-305; M-lto 1-304: M-l to D-303; M-l to E-302; M-l to L- 
301; M-l to Q-300; M-l to R-299; M-l to R-298; M-l to T-297; M-lto H-296; M-l to 
K-295; M-l to H-294; M-l to E-293; M-l to E-292; M-l to P-291; M-l to D-290: M-l 
to C-289; M-lto F-288; M-l to V-287: M-l to D-286; M-l to R-285; M-l to Q-284: M-l 
to M-283; M-l to T-282; M-l to E-281; M-lto 1-280; M-l to L-279; M-l to N-278; M-l 
to N-277; M-l to Q-276; M-l to L-275; M-l to H-274; M-l to V-273; M-lto S-272; M-l 
to R-271; M-l to L-270; M-l to S-269; M-l to P-268; M-l to P-267; M-l to L-266; M-l 
to P-265; M-lto G-264; M-l to P-263; M-l to 1-262; M-l to S-261; M-l to D-260; M-l 
to L-259; M-l to L-258; M-l to N-257; M-lto D-256; M-l to S-255; M-l to L-254; M-l 
to Y-253; M-l to L-252; M-l to F-251 ; M-l to Q-250; M-l to L-249; M-lto K-248; M-l 
to E-247; M-l to M-246; M-l to A-245; M-l to R-244; M-l to F-243; M-l to A-242; M- 
1 to A-241 ;M-1 to P-240; M-l to Q-239; M-l to 1-238; M-l to G-237; M-l to S-236; M- 
1 to S-235; M-l to Q-234; M-l to L-233:M-1 to R-232; M-l to N-231; M-l to L-230; M- 
1 to R-229; M-l to V-228; M-l to D-227; M-l to L-226; M-l to F-225;M-1 to E-224; M- 
1 to 1-223; M-l to G-222; M-l to S-221; M-l to P-220; M-l to L-219; M-l to V-218; M- 
1 to P-217;M-1 to L-216; M-l to A-215; M-l to E-214; M-l to L-213; M-l to Q-212; M- 
1 to N-21 1; M-l to E-210; M-l to P-209;M-1 to L-208; M-l to 1-207; M-l to L-206; M- 
1 to D-205; M-l to Q-204; M-l to L-203; M-l to A-202; M-l to H-201;M-1 to L-200; 
M-l to L-199; M-l to R-198; M-l to F-197; M-l to A-196; M-l to D-195; M-l to N- 
194; M-l to D-193;M-1 to 1-192; M-l to S-191; M-l to S-190; M-l to 1-189; M-l to L- 
188; M-l to N-187; M-l to N-186; M-l to S-185;M-1 to L-184; M-l to D-183; M-l to I- 
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182; M-l to R-181; M-I to K-180; Ml to L-179; M-l to K-178; M-l toT-177;M-l to 
L-176; M-l to G-175; M-l to K-174; M-l to F-173; M-l to D-172; M-l to E-171; M-l 
to A-170; M-l to R-169;M-1 to 1-168; M-l to R-167; M-l to S-166; M-l to 1-165; M-l 
to R-164; M-l to N-163; M-l to F-162; M-l to R-161;M-1 to A-160; M-l to Y-159; M-l 
to L-158; M-l to Y-157; M-l to A-156; M-l to T-155; M-l to R-154; M-l to R-153;M-1 
toP-152;M-l toL-151:M-l toP-150; M-l toP-149;M-l to 1-148; M-l toD-147; M-l 
to E-146; M-l to L-145;M-1 to D-144; M-l to 1-143: M-l to D-142; M-l to D- 141: M-l 
to C-140; M-l to Y-139; M-l to V-138; M-l to S-137;M-1 to S-136; M-l to G-135: M-l 
to L-134; M-l to C-133; M-l to V-132; M-l to C-131; M-l to V-130; M-l to L-129;M-1 
to C-128; M-l to T-127; M-l to P-126; M-l to L-125; M-l to G-124; M-l to H-I23: M-l 
to N-I22; M-l to P-121;M-1 to Q-120; M-l to S-l 19; M-l to S-118; M-l to L-l 17: M-l 
to L-l 16; M-l to L-l 15; M-l to G-l 14; M-l to A-l 13;M-1 to T-l 12; M-l to T-l 1 1; M-l 
to P-110; M-l toR-109;M-l toT-108;M-l toM-107; M-l toT-106; M-l toP-105;M-l 
to N-104; M-l to S-103; M-l to S-102; M-l to P-101 ; M-l to T-100; M-l to G-99: M-l 
to P-98; M-l to A-97;M-1 to T-96; M-l to T-95; M-l to S-94; M-l to K-93; M-l to A- 
92; M-l to P-91; M-l to S-90; M-l to 1-89; M-l toS-88: M-l to T-87; M-l to A-86; M-l 
to P-85: M-l to A-84; M-l to L-83; M-l to S-82; M-l to T-81; M-l to V-80; M-lto K- 
79; M-l to V-78; M-l to E-77; M-l to P-76; M-l to L-75; M-l to Q-74; M-l to D-73; 
M-l to G-72; M-l to Y-71;M-1 toD-70; M-l to T-69; M-l to L-68; M-l to E-67; M-l to 
E-66; M-l to Y-65; M-l to N-64; M-l to S-63; M-l toL-62; M-l to D-61; M-l to 1-60; 
M-l to V-59; M-l to E-58; M- 1 to G-57; M-l to Y-56; M-l to N-55; M-l to D-54:M-1 
to P-53; M-l to N-52; M-l to L-51; M-l to V-50; M-l to D-49; M-l to N-48; M-l to R- 
47; M-l to L-46; M-l toP-45; M-l to L-44; M-l to V-43; M-l to E-42; M-l to F-41; M-l 
to S-40; M-l to D-39; M-l to G-38; M-l to E-37; M-lto R-36; M-l to P-35; M-l to M- 
34; M-l to Q-33; M-l to E-32; M-l to E-31; M-l to R-30; M-l to R-29; M-l to K-28;M- 
1 to R-27; M-l to E-26; M-l to K-25; M-l to R-24; M-l to P-23; M-l to L-22; M-l to S- 
21; M-l toA-20; M-l toT- 19; M-l to G-l 8; M-l toT-17; M-l toE-16;M-l toQ-15;M- 
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1 to L-14; M-l to V-13; M-l to L-12; M-l to A-l 1;M-1 to L-10; M-l to L-9; M-l to S-8; 
M-l to L-7; of SEQ ID NO:32. Polypeptides encoded by these polynucleotides are also 
encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO: 14 which have been determined 
from the following related cDNA genes: HARAY79R (SEQ ID NO:80), HARA044R 
(SEQ ID NO:81), HARAJ74R (SEQ ID NO:82), HARA066R (SEQ ID NO:83), 
HARAN19R (SEQ ID NO:84), and HARAT78R (SEQ ID NO:85). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, retinal disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the retina, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g. retinal, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 32 as residues: Leu-22 to Asp-39, Asn-64 to Pro-76, Pro-98 to 
Thr-1 1 1, Pro-291 to Glu-302. Polynucleotides encoding said polypeptides are also 
provided. 

The tissue distribution in retinal tissue, and the homology to a Gallus gallus 
proteoglycan involved in the ossification process indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment of disorders of the 
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retina which involve the adhesion of tissues, or the binding of certain proteins to the cell 
surface. The translation products of this gene are useful for the treatment of retinal 
disorders such as retinal detachment in individuals suffering from myopia, or in the 
treatment of macular degeneration. Furthermore, this gene may serve as a tumor marker 
5 for retinoblastomas, or related tumors. More generally, the tissue distribution in retinal 
tissue indicates that The translation product of this gene is useful for the diagnosis, 
detection and/or treatment of eye disorders including blindness, color blindness, impaired 
vision, short and long sightedness, retinitis pigmentosa, retinitis proliferans, and 
retinoblastoma, retinochoroiditis, retinopathy and retinoschisis. Based upon the tissue 

10 distribution of this protein, antagonists directed against this protein is useful in blocking 
the activity of this protein. Accordingly, preferred are antibodies which specifically bind 
a portion of the translation product of this gene. 

Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 

15 product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 
for the translation product of this gene to a bodily fluid from the individual, preferably 
serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 

20 above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
are more particularly described elsewhere herein. Furthermore, the protein may also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 

25 against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 14 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
5 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 1375 of SEQ ID NO: 14, b is an integer of 15 to 
1389, where both a and b correspond to the positions of nucleotide residues shown in 
10 SEQ ID NO: 14, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 5 

15 The translation product of this gene shares sequence homology with the CD33 

protein (See Genbank Accession No. gi|2913995). The expression pattern of CD33 
within the hematopoietic system indicates a potential role in the regulation of myeloid 
cell differentiation. However, this expression is absent from hematopoietic stem cells. 
CD33 is expressed in clonogenic leukemia cells in about 90% of patients suffering from 

20 acute myeloid leukemia (AML). While about 60-70% of adults suffering from AML 
experience complete remission due to chemotherapy application, most of these patients 
will ultimately die of relapsed leukemia. It is believed that, like CD33, the CD33-like 
protein of the present invention is also expressed by clonogenic leukemia cells from the 
vast majority of patients with AML. Thus, there is a clear need to identify and isolate 

25 nucleic acid molecules encoding additional polypeptides having 

CD33-like protein activity. It is believed that cancerous tissue contains significantly 
greater amounts of CD33-like protein gene copy number and expresses significantly 
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enhanced levels of CD33-like protein and mRNA encoding the CD33-like protein when 
compared to a "standard" mammal, i.e. -a mammal of the same species not having the 
cancer or inflammatory disease. Thus, enhanced levels of the CD33-like protein will be 
detected in certain bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal 
fluid) from mammals when compared to sera from mammals of the same species not 
having the cancer or inflammatory disease. 

Two related cDNA genes, HDPIB36 and HEOMH10, have been isolated. These 
cDNA genes appear to encode splice variants of this gene. Preferred polynucleotides 
comprise the following sequences: 

CGACCCACGCGTCCGCCGCCTTCGGCTTCCCCTTCTGCCAA 

GAGCCCTGAGCCACTCACAGCACGACCAGAGA (SEQ ID NO: 86), 

GTATGGAATGGGGTGGGAACCCCTGCCTCTCACACTGGGGAGGGACCCTGGG 

GACAGCCTATGGGCTGAGCAGAGAGGGCTCTCAGGGACCCCTGCAGCACAA 

GAATCTCCCACCACGGTCTCTGTCCCAGCCCTGACTCAGAAGCCTGATGTCTA 

CATCCCCGAGACCCTGGAGCCCGGGCAGCCGGTGACGGTCATCTGTGTGTTT 

AACTGGGCCTTTGAGGAATGTCCACCCCCTTCTTTCTCCTGGACGGGGGCTGC 

CCTCTCCTCCCAAGGAACCAAACCAACGACCTCCCACTTCTCAG (SEQ ID NO: 

87), 

ATCCTCCAGAGAACCTGAGAGTGATGGTTTCCCAAGCAAACAGGACAGGTA 
GGAAAGGGGACAGAGGAGCCAAGGCCTCTCAGTGCCGAATTGGGGGCCCAG 
GAGTCTGGAGGGTCCCCACGCAGGAGGGTCCCTGAGCCCTGAGCTGCTCATC 
GATTCTGCCTCTTCCTTCCCT (SEQ ID NO: 88), 

GTGAGTGGGGGAAAGGGGACACCTGGGTCCCAGGAAGGGGACCCTGCTGAG 
TCCTGTCCTCCCTCCCCTCAG (SEQ ID NO: 89), 

CTGGCCCCCTGGCTCAGAAGCGGAATCAGAAAGCCACACCAAACAGTCCTCG 
GACCCCTCTTCCACCAGGTGCTCCCTCCCCAGAATCAAAGAAGAACCAGAAA 
AAGCAGTATCAGTTGCCCAGTTTCCCAGAACCCAAATCATCCACTCAAGCCC 
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CAGAATCCCAGGAGAGCCAAGAGGAGCTCCATTATGCCACGCTCAACTTCCC 
AGGCGTCAGACCCAGGCCTGAGGCCCGGATGCCCAAGGGCACCCAGGCGGA 
TTATGCAGAAGTCAAGTTCCAATGAGGGTCTCTTAGGCTTTAGGACTGGGAC 

TTCGGCTAGGGAGGAAGGTAGAGTAAGAGGTTGAAGATAACAGAGTGCAAA 

GmCCTTCTCTCCCTCTCTCTCT^ 

TTT (SEQ ID NO: 90), and/or 

AAAAAAACATCTGGCCAGGGCACAGTGGCTCACGCCTGTAATCCCAGCACTT 

TGGGAGGTTGAGGTGGGCAGATCGCCTGAGGTCGGGAGTTCGAGACCAGCC 

TGGCCAACTTGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGG 

CATGGTGGCAGGCGCCTGTAATCCTACTACTTGGGAAGCTGAGGCAGGAGAA 

TCACTTGAACCTGGGAGACGGAGGTTGCAGTGAGCCAAGATCACACCATTGC 

ACGCCAGCTTGGGCAACAAAGCGAGACTCCATCTCAAAAAAAAAATCCTCC 

AAATGGGTTGGGTGTCTGTAATCCCAGCACTTTGGGAGGCTAAGGTGGGTGG 

ATTGCTTGAGCCCAGGAGTTCGAGACCAGCCTGGGCAACATGGTGAAACCCC 

ATCTCTACAAAAAATACAAAACATAGCTGGGCTTGGTGGTGTGTGCCTGTAG 

TCCCAGCTGTCAGACATTTAAACCAGAGCAACTCCCATCTGGAATGGGAGCT 

GAATAAAATGAGGCTGAGACCTACTGGGCTGCCATTCTCAGACAGTGGAGGC 

CATTCTAAGTCACAGGATGAGACAGGAGGTCCGTACAAGATACAGGTCATA 

AAGACTTTGCTGATAAAACAGATTGCAGTAAAGAAGCCAACCAAATCCCACC 

AAAACCAAGTTGGCCACGAGAGTGACCTCTGGTCGTCCTCACTGCTACACTC 

CTGACAGCACCATGACAGTTTACAAATGCCATGGCAACATCAGGAAGTTACC 

CGATATGTCCCAAAAGGGGGAGGAATGAATAATCCACCCCTTGTTTAGCAAA 

TAAGCAAGAAATAACCATAAAAGTGGGCAACCAGCAGCTCTAGGCGCTGCT 

CTTGTCTATGGAGTAGCCATTCTTTTGTTCCTITACTTTCTTAATAAACTTGCT 

TTCACCTTAAAAAAAAAAAAAAAAAAAAAA (SEQ ID NO:91). Also preferred 

are the polypeptides encoded by these polynucleotides. 
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Figures 13A-C shows the nucleotide (SEQ ID NO: 15) and deduced amino acid 
sequence (SEQ ID NO:33) of the CD33-like protein. Predicted amino acids from about 1 
to about 16 constitute the predicted signal peptide (amino acid residues from about 1 to 
about 16 in SEQ ID NO:33) and are represented by the underlined amino acid regions; 
5 and amino acids from about 496 to about 5 12 constitute the predicted transmembrane 
domain (amino acid residues from about 496 to about 512 in SEQ ID NO:33) and are 
represented by the double-underlined amino acid regions. 

Figure 14 shows the regions of similarity between the amino acid sequences of 
the CD33-like protein SEQ ID NO:33, and the CD33L1 protein (gi|88178) (SEQ ID NO: 
10 92). 

Figure 15 shows an analysis of the amino acid sequence of SEQ ID NO:33. 

Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; 
amphipathic regions; flexible regions; antigenic index and surface probability are shown. 

Northern analysis indicates that this gene is expressed highest in spleen tissue and 
15 peripheral blood leukocytes, and to a lesser extent in ovary and lung tissue. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 
Figures 13A-C (SEQ ID NO:33), which was determined by sequencing a cloned cDNA 
(HDPCL05). The nucleotide sequence shown in Figures 13A-C (SEQ ID NO: 15) was 
20 obtained by sequencing a cloned cDNA (HDPCL05), which was deposited on Nov. 17, 
1998 at the American Type Culture Collection, and given Accession Number 203484. 
The deposited gene is inserted in the pSport plasmid (Life Technologies, Rockville, MD) 
using the Sall/NotI restriction endonuclease cleavage sites. 

The present invention is further directed to fragments of the isolated nucleic acid 
25 molecules described herein. By a fragment of an isolated DNA molecule having the 

nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO: 15 is intended DNA fragments at least about 15nt, and more preferably at least about 
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20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
5 of the deposited cDNA or as shown in SEQ ID NO: 15. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 15. In this context "about" includes the particularl y recited size, 
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to S-387; M-l to Y-386;M-1 to H-385; M-l to V-384; M-l to S-383; M-l to L-382; M-l 
toS-381; M-l to L-380; M-l to S-379; M-l to V-378; M-l to H-377;M-1 to Q-376; M-l 
to S-375; M-l to G-374; M-l to L-373; M-l toP-372; M-l to H-371; M-l to R-370; M-l 
to A-369; M-l to H-368;M-1 to C-367; M-l to T-366; M-l to F-365; M-l to E-364; M-l 
5 toG-363; M-l to E-362; M-l to H-361; M-l to E-360; M-l to V-359;M-1 to Q-358; M-l 
to V-357; M-l to R-356; M-l to P-355; M-l toL-354; M-l to E-353; M-l to L-352; M-l 
to V-351; M-l to G-350;M-1 to P-349; M-l to D-348; M-l to S-347; M-l to P-346; M-l 
toQ-345; M-l to S-344; M-l to P-343; M-l to S-342; M-l to L-341;M-1 to V-340; M-l 
to Q-339; M-l to G-338; M-l to R-337; M-l toQ-336; M-l to T-335; M-l to W-334; M- 
10 1 to S-333; M-l to L-332;M-1 to R-331; M-l to A-330; M-l to P-329; M-l to P-328; M- 
1 toS-327; M-l to S-326; M-l to H-325; M-l to T-324; M-l to V-323;M-1 to C-322; M- 
1 toV-321;M-l toL-320;M-l toC-319; M-l toL-318;M-l toS-317;M-l toQ-316; M- 
1 toG-315;M-l toE-314;M-l toL-313;M-l to V-312; M-l toP-311;M-l toL-310; M- 
1 toS-309; M-l to T-308; M-l to G-307; M-l to N-306; M-l to G-305;M-1 to L-304; M- 
15 1 to N-303; M-l to E-302; M-l to L-301; M-l toV-300; M-l to T-299; M-l to R-298; 
M-l to N-297; M-l to A-296;M-1 to Q-295; M-l to S-294; M-l to V-293; M-l to M- 
292; M-l toV-291; M-l to R-290; M-l to L-289; M-l to N-288; M-l to E-287;M-1 to P- 
286; M-l to P-285; M-l to Y-284; M-l to Q-283; M-l toV-282; M-l to S-281; M-l to L- 
280; M-l to D-279; M-l to L-278;M-1 to A-277; M-l to R-276; M-l to Q-275; M-l to 
20 Q-274; M-l toS-273; M-l to G-272; M-l to L-271; M-l to R-270; M-l to N-269;M-1 to 
E-268; M-l to A-267; M-l to R-266; M-l to C-265; M-l toT-264; M-l to Y-263; M-l to 
R-262; M-l to G-261; M-l to S-260;M-1 to D-259; M-l to G-258; M-l to A-257; M-l to 
K-256; M-l toV-255; M-l to G-254; M-l to P-253; M-l toL-252; M-l toE-251;M-l to 
L-250; M-l to G-249; M-l to L-248; M-l to P-247; M-l toR-246; M-l to P-245; M-l to 
25 G-244; M-l to W-243; M-l to P-242;M-1 to H-241; M-l to S-240; M-l to S-239; M-l to 
S-238; M-l toL-237; M-l to V-236; M-l to R-235; M-l to N-234; M-l to Q-233;M-1 to 
L-232; M-l to V-231; M-l to W-230; M-l to S-229; M-l toL-228; M-I toT-227; M-l to 
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A-226; M-J to P-225; M-l to P-224;M-1 to Q-223; M- 1 to S-222; M-l to D-221 ; M-l to 
A-220; M-l toA-219; M-l to C-218; M-l to L-217; M-l to L-216; M-l to R-215;M-1 to 
L-214; M-l to F-213; M-l to Q-212; M-l to G-21 1; M-l toK-210; M-l to Q-209; M-l to 
A-208; M-l to E-207; M-l to L-206;M-1 to Y-205; M-l to P-204; M-l to V-203; M-l to 
5 N-202; M-l toG-201; M-l to Q-200; M-l to P-199; M-l to Q-198; M-l to P-197;M-1 to 
E-196; M-l to L-195; M-l to A-194; M-l to P-193; M-l toT-192; M-l to N-191; M-l to 
D-190; M-l toR-189; M-l toS-188;M-l to 1-187; M-l toS-186; M-l to 1-185; M-l to 
V-184; M-l toL-183; M-l to D-182; M-l to R-181; M-l to P-180; M-l to A-179;M-1 to 
Y-178; M-l to A-177; M-l to V-176; M-l to R-175; M-l toL-174; M-l to R-l-73; M-l to 

10 V-172;M-1 toT-171;M-ltoR-170;M-l toQ-169;M-l to A-168;M-1 toS-167;M-l to 
V-166.M-1 toG-165; M-l toK-164; M-l toR-163;M-l toS-162; M-l to F-161;M-1 to 
D-160; M-l to V-159; M-l toH-158; M-l to C-157; M-l toT-156; M-l toL-155; M-l to 
D-154; M-l toT-153;M-l toN-152;M-l to H-151; M-l to D-150; M-l toQ-149;M-l 
to P-148; M-l toR-147; M-l to P-146; M-l to T-145; M-l to F-144; M-l to S-143;M-1 

15 to L-142; M-l to V-141; M-l to T-140; M-l to V-139; M-l toK-138; M-l to L-137; M-l 
toF-136;M-l toF-135;M-l toG-134;M-l toD-133; M-l to N-132; M-l toM-131;M-l 
to F-130; M-l toN-129; M-l to Y-128; M-l to R-127; M-l to V-126; M-l to Y-125;M-1 
to S-124; M-l to G-123; M-l to R-122; M-l to E-121; M-l toV-120; M-l to R-119; M-l 
to F-118; M-l to F-l 17; M-l to Y-116;M-1 to Q-l 15; M-l to S-114; M-l to E-113; M-l 

20 to D-l 12; M-l toQ-1 1 1; M-l to M-l 10; M-l to Q-109; M-l to A-108; M-l to D-107;M- 
1 to R-106; M-l to 1-105; M-l to V-104; M-l to L-103; M-l toS-102; M-l to C-101; M- 
1 to N-100; M-l to G-99; M-l to K-98;M-1 to A-97; M-l to P-96; M-l to D-95; M-l to 
G-94; M-l toT-93; M-l to L-92; M-l to Q-91; M-l to F-90; M-l to R-89; M-l toG-88; 
M-l to R-87; M-l to T-86; M-l to S-85; M-l to M-84; M-lto E-83; M-l to V-82; M-l to 

25 E-81; M-l to R-80; M-l to S-79; M-lto Q-78; M-l to H-77; M-l to N-76; M-l to T-75; 
M-l to A-74;M-1 to V-73; M-l to P-72; M-l to A-71; M-l to G-70; M-l toK-69; M-l to 
T-68; M-l to T-67; M-l to E-66; M-l to T-65; M-l toV-64; M-l to A-63; M-l to K-62; 
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M-l to F-61; M-l to W-60; M-lto Y-59; M-l to G-58; M-l to Y-57; M-l to A-56; M-l 
to P-55;M-1 to T-54; M-l to S-53; M-l to G-52; M-I to T-51; M-l toW-50; M-l to D- 
49; M-l to Q-48; M-l to R-47; M-l to P-46; M-lto Y-45; M-l to S-44; M-l to F-43; M- 
1 to S-42; M-l to C-41; M-lto P-40; M-l to V-39; M-l to S-38; M-l to 1-37; M-l to D- 
5 36; M-lto C-35; M-l to A-34; M-l to E-33; M-l to P-32; M-l to V-31;M-1 to M-30; M- 
1 to V-29; M-l to S-28; M-l to E-27; M-l toQ-26; M-l to V-25; M-l to R-24; M-l to I- 
23; M-l to W-22; M-lto F-21; M-l to R-20; M-l to G-19; M-l to D-18; M-l to M- 
17;M-1 to A-16;M-1 to Q-15; M-l to S-14;M-1 toG-13;M-l toG-12:M-l to L-11;M- 
1 to L-10; M-l to S-9; M-l to S-8; M-l toL-7; of SEQ ID NO:33. Polypeptides encoded 
10 by these polynucleotides are also encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO: 15 which have been determined 
from the following related cDNA genes: HTOFA26R (SEQ ID NO:93), HWAEM43R 
(SEQ ID NO:94), HDPMQ69R (SEQ ID NO:95), HDPGA09RA (SEQ ID NO:96), 
15 HEOMH10R (SEQ ID NO:97), and HFKCT73F (SEQ ID NO:98). 

The polypeptide of this gene has been determined to have a transmembrane 
domain at about amino acid position 496 - 512 of the amino acid sequence referenced in 
Table XIII for this gene. Moreover, a cytoplasmic tail encompassing amino acids 513 to 
639 of this protein has also been determined. Based upon these characteristics, it is 
20 believed that the protein product of this gene shares structural features to type la 
membrane proteins. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of the following diseases and conditions which 
25 include, but are not limited to, disorders of the immune system, in particular the 

immunodiagnosis of acute leukemias. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful to provide immunological probes for differential 
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identification of the tissue(s) or ceil type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is detected in certain tissues or cell types (e.g., 
immune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 33 
as residues: Pro-46 to Gly-52, Asn-76 to Val-82, Ser-85 to Phe-90, Gly-94 to Asn-100, 
Gin- 1 1 1 to Tyr-1 16, Pro- 146 to Leu- 155, Ser-188 to Asn-202, Ser-240 to Arg-246, Gly- 
258 to Tyr-263, Ala-267 to Arg-276, Ser-326 to Arg-331, Ser-333 to Gln-339, Pro-343 
to Asp-348, Glu-426 to Asp-432, Pro-517 to His-533, AIa-550 to Pro-565, Gly-569 to 
Gln-582, Pro-589 to GIu-606, Gly-616 to Ala-623, Met-625 to Ala-631. 

CD33 monoclonal antibodies (MoAB) are important in the immunodiagnosis of 
AML. CD33 MoABs have been used in preliminary therapeutic trials for purging bone 
marrow of AML patients, either before transplantation or for diseases resistant to 
chemotherapy. To prevent AML patients in remission from suffering relapse, or due to 
the lack of an appropriate allogenic bone marrow donor, a method is necessary for 
purging leukemia cells from the autografts of patients with advanced AML. By the 
invention, this method is provided by which bone marrow from an AML patient is 
obtained by, for example, percutaneous aspirations from the posterior iliac crest, 
isolating bone marrow mononuclear by Ficoll-hypaque density gradient centrifugation, 
and incubating with an anti-CD33-like protein MoAB, for example, 3-5 times for 15-30 
min. at 4-6 degrees C, followed by incubation with rabbit complement at about 37 
degrees C for 30 minutes. The patient is then subject to myeloablative chemotherapy, 
followed by reinfusion of the treated autologous bone marrow according to standard 
techniques. By the invention, clonogenic tumor cells are depleted from the bone marrow 
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while sparing hematopoietic cells necessary for engraftment. In a further 
embodimenuhe invention provides an in vivo method for selectively killing or inhibiting 
growth of tumor cells expressing CD33-like protein antigen of the present invention. The 
method involves administering to the patient an effective amount of an antagonist to 
inhibit the CD33-like protein receptor signaling pathway. By the invention, 
administering such antagonist of the CD33-like protein to a patient may also be useful 
for treating inflammatory diseases including arthritis and colitis. Antagonists for use in 
the present invention include polyclonal and monoclonal antibodies raised aginst the 
CD33-like protein or a fragment thereof, antisense molecules which control gene 
expression through antisense DNA or RNA or through triple-helix formation, proteins or 
other compounds which bind the CD33-like protein domains, or soluble forms of the 
CD33-like protein, such as protein fragments including the extracellular region from the 
full length receptor, which antagonize CD33-like protein mediated signaling by 
competing with the cell surface CD33-like protein for binding to CD33 receptor ligands. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 15 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 2281 of SEQ ID N0.15, b is an integer of 15 to 
2295, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 15, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 6 

This invention relates to newly identified polynucleotides, polypeptides encoded 
5 by such polynucleotides, the use of such polynucleotides and polypeptides, as well as the 
production of such polynucleotides and polypeptides. The polypeptide of the present 
invention has been putatively identified as a CD33 homolog derived from a human 
primary dendritic cells cDNA library. More particularly, the polypeptide of the present 
invention has been putatively identified as a human siglec homolog, sometimes hereafter 
10 referred to as "CD33-like 3" and/or "siglec 7". The invention also relates to inhibiting 
the action of such polypeptides. 

The siglecs (sialic acid binding Ig-like lectins) are type 1 membrane proteins that 
constitute a distinct subset of the Ig superfamily, characterised by their sequence 
similarities and abilities to bind sialic acids in glycoproteins and glycolipid (Crocker, 
15 P.R., et ah, Glycobiology:8 (1998)). Members of the Ig Superfamily of proteins are 
defined as molecules that share domains of sequence similarity with the variable or 
constant domains of antibodies. 

Many Ig superfamily proteins consist of multiple tandem Ig-like domains 
connected to other domains, such as Fn-III repeat domains (Vaughn, D.E., and P.J. 
20 Bjorkman, Neuron, 16:261-73 (1996)). At the primary structural level, traditional Ig-like 
domains can be identified by the presence of two cysteine residues separated by 
approximately 55-75 amino acid residues, and an "invariant" tryptophan residue located 
10-15 residues C-terminal to the first of the two conserved cysteine residues. The two 
conserved cysteine residues are thought to be involved in disulfide bonding to form the 
25 folded Ig structures (Vaughn, D.E., (1996)). 

Ig-like domains further share a common folding pattern, that of a sandwich or 
fold structure of two b-sheets consisting of antiparallel b-strands containing 5-10 amino 
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acids (Huang, Z., et al., Biopolymers, 43:367-82 (1997)). Ig-like domains are divided, 
based upon sequence and structural similarities, into four classifications known as CI, 
C2, 1 and V-Iike domains. 

The functional determinants of the Ig-like domains are presented on the faces of 
b-sheets or the loop regions of the Ig-fold. Accordingly, protein-protein interactions can 
occur either between the faces of the b-sheets, or the loop regions of the Ig-fold (Huang, 
Z., (1998)). These Ig-like domains are involved in mediating a diversity of biological 
functions such as intermolecular binding and protein-protein homophilic or heterophilic 
interactions. 

Thus, Ig-like domains play an integral role in facilitating the activities of proteins 
of the Ig superfamily. In mammals, the group currently comprises sialoadhesin/siglec-1, 
CD22/siglec-2, CD33/siglec-3, myelin associated glycoprotein (MAG/siglec-4), siglecs- 
5, -6 and -7 

(Crocker, P.R., et al., EMBO J., 13:4490-503 (1994); Sgroi, D., et al., J Biol. Chem., 
268:701 1-18 (1993); Freeman, D.S., et al., Blood, 85:2005-12 (1995); Kelm, S., et al., 
Curr Biol., 4:965-72 (1994); Cornish, A.L., et al., Blood, 92:2123-132 (1998); Patel, N., 
et al., J Biol. Chem, 274:22729-738 (1999); Nicol, G., et al., J Biol. Chem, In Press 
(1999)). Siglec-7 has also recently been characterised independently as the NK receptor 
p75/AIRMl (Falco, M M et al., J. Exp. Med., 190:793-802 (1999)). In addition, the gene 
encoding another siglec-like sequence, OBBP-like protein has been reported but there is 
no information on its binding activity (Yousef, G.M., et al., Biochem. Biophys. Res. 
Commun., In Press (1999)). 

Each of these proteins has an extracellular region made up of a membrane distal 
V-set domain followed by varying numbers of C2 set domains which range from 16 in 
sialoadhesin to 1 in CD33. In the cases of sialoadhesin, CD22, MAG and CD33, the 
sialic acid binding site has been mapped to the V-set domain and for sialoadhesin it has 
been further characterised at the molecular level by X-ray crystallography 1 1 (Nath, D., 
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et a!., J Biol. Chem., 270:26184-91 (1995); van der Merwe, P.A., et aL, J, Biol. Chem., 
271:9273-80 (1996); Tang, S., et ah, L Cell Biol., 138:1355-66 (1997); Taylor, V.C., et 
al., J. Biol. Chem., 274:1 1505-12 (1999); May, A.P., et aL, Molecular Cell, 1:719-28 
(1998)). 

Apart from MAG and SMP that are found exclusively in the nervous system, all 
siglecs characterised to date are expressed on discrete subsets of hemopoietic cells and 
can provide useful lineage-restricted markers. Thus, CD22 is present only on mature B 
cells, sialoadhesin is on macrophage subsets, CD33 is a marker of early committed 
myeloid progenitor cells, siglec-5 is expressed by monocytes and mature neutrophils, 
siglec-6 is on B cells and siglec-7 is expressed by NK cells and monocytes (Dorken, B., 
et al., J. Immunology, 136:4470-79 (1986); Crocker, P.R., et al., J. Exp. Med., 164:1862- 
75 (1986); Peiper, S.C., et al., In Leukocyte Typing IV. Oxford University Press, Oxford. 
814-16 (1989); Cornish, A.L., et al., Blood, 92:2123-132 (1998); Patel, N., et al., J Biol. 
Chem, 274:22729-738 (1999); Nicol, G., et al, J Biol. Chem, In Press (1999)). These 
expression patterns indicate discrete functions amongst hemopoietic cell subsets, but 
apart from CD22, a well-characterised negative regulator of B cell activation (reviewed 
in Cyster, J.G. and C.C. Goodnow, Immunity, 6:509-17 (1997)), the biological functions 
of siglecs expressed in the hemopoietic system are unknown. Proposed functions include 
cell-cell interactions through recognition of sialylated glycoconjugates on other cells. 
However, a number of studies have also shown that cell-cell adhesion mediated by 
siglecs can be modulated by cis-interactions with sialic acids present in the host plasma 
membrane. This is particularly striking for CD22, CD33 and siglec-5, whose binding 
activities can be greatly increased if host cells are pretreated with sialidase to remove the 
cis-competing sialic acids (Freeman, D.S., et al., Blood, 85:2005-12 (1995); Cornish, 
A.L., et al., Blood, 92:2123-132 (1998); Sgroi, D., et al., P.N.A.S., 92:4026-30 (1995)). 

Besides potential roles in cellular interactions, there is growing evidence that, 
similar to CD22, the more recently characterised siglecs are involved in signalling 
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functions. The cytoplasmic tails of CD33 and siglecs-5, -6 and -7 have two well- 
conserved tyrosine-based motifs that are similar to well-characterised signaling motifs in 
other leukocyte receptors (Gergely, J., et al., Immun. Lett, 68:3-15 (1999)). Where 
studied, both tyrosine residues can be phosphorylated by src-like kinase(s) and, in the 
case of the membrane proximal tyrosine, this leads to subsequent recruitment of the 
tyrosine phosphatases, SHP-1 and SHP-2 (Falco, M. f et ah, J. Exp. Med., 190:793-802 
(1999); Taylor, V.C., et al., J. Biol. Chem., 274:1 1505-12 (1999)). 
Thus there exists a clear need for identifying and exploiting novel members of the siglec 
family of immunoglobulin proteins. Although structurally related, such proteins may 
possess diverse and multifaceted functions in a variety of cell and tissue types. The 
inventive purified siglec proteins are research tools useful for the identification, 
characterization and purification of cell signaling molecules. Furthermore, the 
identification of new siglecs permits the development of a range of derivatives, agonists 
and antagonists at the nucleic acid and protein levels which in turn have applications in 
the treatment and diagnosis of a range of conditions such as cancer, inflammation, 
neurological disorders and immunological disorders, amongst many other conditions. 
The polypeptide of the present invention has been putatively identified as a member of 
the siglec family and has been termed CD33-like 3. This identification has been made as 
a result of amino acid sequence homology to the human cd33U (See Genbank Accession 
No. gi|29 13995). 

Figures 16A-B show the nucleotide (SEQ ID NO: 16) and deduced amino acid 
sequence (SEQ ID NO:34) of CD33-like 3. Predicted amino acids from about 1 to about 
18 constitute the predicted signal peptide (amino acid residues from about 1 to about 18 
in SEQ ID NO:34) and are represented by the underlined amino acid regions; and amino 
acids from about 360 to about 376 constitute the predicted transmembrane domain 
(amino acids from about 360 to about 376 in SEQ ID NO:34) and are represented by the 
double underlined amino acids. 
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Figure 17 shows the regions of similarity between the amino acid sequences of 
the CD33-Iike 3 protein (SEQ ID NO:34) and the human CD33L1 protein (SEQ ID 
NO:99). 

Figure 18 shows an analysis of the CD33-like 3 amino acid sequence. Alpha, 
5 beta, turn and coil regions; hydrophilicity and hydrophobicity; amphipathic regions; 
flexible regions; antigenic index and surface probability are shown. 

A polynucleotide encoding a polypeptide of the present invention is obtained 
from human NK cells, T-cells, primary dendritic ceils, placenta, spleen, primary breast 
cancer, gall bladder, apoptotic t-cells, macrophage, and chronic lymphocytic leukemia 
10 spleen. The polynucleotide of this invention was discovered in a human primary 
dendritic cell cDNA library. 

As shown in Figures 16A-B, CD33-like 3 has a transmembrane domain (the 
transmembrane domains comprise amino acids from about 360 to about 376 of SEQ ID 
NO:34; which correspond to amino acids from about 360 to about 376 of Figures 16A-B 
15 ). The polynucleotide contains an open reading frame encoding the CD33-like 3 

polypeptide of 467 amino acids. CD33-like 3 exhibits a high degree of homology at the 
amino acid level to the human CD33L1 (as shown in Figure 18). 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the CD33-like 3 polypeptide having the amino acid sequence 
20 shown in Figures 16A-B (SEQ ID NO:34). The nucleotide sequence shown in Figures 
16A-B (SEQ ID NO: 16) was obtained by sequencing a cloned cDNA (HDPUW68), 
which was deposited on November 17 at the American Type Culture Collection, and 
given Accession Number 203484. 

The present invention is further directed to fragments of the isolated nucleic acid 
25 molecules described herein. By a fragment of an isolated DNA molecule having the 

nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO: 16 is intended DNA fragments at least about 15nt, and more preferably at least about 
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20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO: 16. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 16. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of CD33-like 3 polynucleotide fragments of the 
invention include, for example, fragments that comprise, or alternatively, consist of, a 
sequence from about nucleotide 1 to about 50, from about 51 to about 100, from about 
101 to about 150, from about 151 to about 200, from about 201 to about 250, from about 
251 to about 300, from about 301 to about 350, from about 351 to about 400, from about 
401 to about 450, from about 451 to about 500, from about 501 to about 550, from about 
551 to about 600, from about 601 to about 650, from about 651 to about 700, from about 
701 to about 750, from about 751 to about 800, from about 801 to about 850, from about 
851 to about 900, from about 901 to about 950, from about 951 to about 1000, from 
about 1001 to about 1050, from about 1051 to about 1100, from about 1 101 to about 
1 150, from about 1 151 to about 1200, from about 1201 to about 1250, from about 1251 
to about 1300, from about 1301 to about 1350, from about 1351 to about 1400, from 
about 1401 to about 1450, from about 1451 to about 1500, from about 1501 to about 
1550, from about 1551 to about 1600, from about 1601 to about 1650, from about 1651 
to about 1700, from about 1701 to about 1748 of SEQ ED NO: 16, or the complementary 
strand thereto, or the cDNA contained in the deposited gene. In this context "about" 
includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
nucleotides, at either terminus or at both termini. 
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Preferred nucleic acid fragments of the present invention include nucleic acid 
molecules encoding a member selected from the group: a polypeptide comprising or 
alternatively, consisting of, the transmembrane domain (amino acid residues from about 
360 to about 376 in Figures 16A-B (amino acids from about 360 to about 376 in SEQ ID 
5 NO:34). Since the location of these domains have been predicted by computer analysis, 
one of ordinary skill would appreciate that the amino acid residues constituting these 
domains may vary slightly (e.g., by about 1 to 15 amino acid residues) depending on the 
criteria used to define each domain. 

In additional embodiments, the polynucleotides of the invention encode functional 

10 attributes of CD33-like 3. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 

15 hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions of CD33-Iike 3. The 
data representing the structural or functional attributes of CD33-like 3 set forth in Figure 
18 and/or Table VI, as described above, was generated using the various modules and 
algorithms of the DNA*STAR set on default parameters. In a preferred embodiment, the 

20 data presented in columns VIII, IX, XIII, and XIV of Table VI can be used to determine 
regions of CD33-like 3 which exhibit a high degree of potential for antigenicity. Regions 
of high antigenicity are determined from the data presented in columns VIII, IX, XIII, 
and/or XIV by choosing values which represent regions of the polypeptide which are 
likely to be exposed on the surface of the polypeptide in an environment in which antigen 

25 recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 18 , but may, as 
shown in Table VI, be represented or identified by using tabular representations of the 
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data presented in Figure 18 . The DNA*STAR computer algorithm used to generate 
Figure 18 (set on the original default parameters) was used to present the data in Figure 
18 in a tabular format (See Table VI). The tabular format of the data in Figure 18 is used 
to easily determine specific boundaries of a preferred region. The above-mentioned 
5 preferred regions set out in Figure 18 and in Table VI include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figures 16A-B. As set out in Figure 18 and in Table VI, such preferred regions 
include Garnier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, 
Chou-Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic 
10 regions and Hopp-Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic 
regions, Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index 
and Emini surface-forming regions. Even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities, ability to 

15 multimerize, modulate cellular interaction, or signalling pathways, etc.) may still be 
retained. For example, the ability of shortened CD33-iike 3 muteins to induce and/or 
bind to antibodies which recognize the complete or mature forms of the polypeptides 
generally will be retained when less than the majority of the residues of the complete or 
mature polypeptide are removed from the N-terminus. Whether a particular polypeptide 

20 lacking N-terminal residues of a complete polypeptide retains such immunologic 

activities can readily be determined by routine methods described herein and otherwise 
known in the art. It is not unlikely that an CD33-like 3 mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six CD33-Iike 3 amino acid residues 

25 may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the CD33-like 3 amino acid sequence 
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shown in Figures 16A-B , up to the glutamic acid residue at position number 462 and 
polynucleotides encoding such polypeptides. In particular, the present invention 
provides polypeptides comprising the amino acid sequence of residues nl-467 of Figures 
16A-B , where nl is an integer from 2 to 462 corresponding to the position of the amino 
5 acid residue in Figures 1 6 A-B (which is identical to the sequence shown as SEQ ID 
NO:34). In another embodiment, N-terminal deletions of the CD33-like 3 polypeptide 
can be described by the general formula n2-467, where n2 is a number from 2 to 462. 
corresponding to the position of amino acid identified in Figures 16A-B . N-terminal 
deletions of the CD33-like 3 polypeptide of the invention shown as SEQ ID NO:34 

10 include polypeptides comprising the amino acid sequence of residues: N-terminal 
deletions of the CD33-like 3 polypeptide of the invention shown as SEQ ID NO:34 
include polypeptides comprising the amino acid sequence of residues: L-2 to K-467; L-3 
to K-467; L-4 to K-467: L-5 to K-467: L-6 to K-467; L-7 to K-467; P-8 to K-467; L-9 to 
K-467; L-10 toK-467; W-l 1 to K-467; G-12 to K-467; R-13 to K-467; E-14 to K-467; 

15 R-15 to K-467; V-16 to K-467; E-17 to K-467; G-18 to K-467;Q- 19 to K-467; K-20 to 
K-467; S-21 to K-467; N-22 to K-467; R-23 to K-467; K-24 to K-467; D-25 to K-467; 
Y-26 to K-467; S-27 toK-467; L-28 to K-467; T-29 to K-467; M-30 to K-467; Q-31 to 
K-467; S-32 to K-467; S-33 to K-467; V-34 to K-467; T-35 to K-467;V-36 to K-467; Q- 
37 to K-467; E-38 to K-467; G-39 to K-467; M-40 to K-467; C-41 to K-467; V-42 to K- 

20 467; H-43 to K-467; V-44 toK-467; R-45 to K-467; C-46 to K-467; S-47 to K-467; F-48 
to K-467; S-49 to K-467; Y-50 to K-467; P-51 to K-467; V-52 to K-467;D-53 to K-467; 
S-54 to K-467; Q-55 to K-467; T-56 to K-467; D-57 to K-467; S-58 to K-467; D-59 to 
K-467; P-60 to K-467; V-61 toK-467; H-62 to K-467; G-63 to K-467; Y-64 to K-467; 
W-65 to K-467; F-66 to K-467; R-67 to K-467; A-68 to K-467; G-69 to K-467;N-70 to 

25 K-467; D-71 to K-467; 1-72 to K-467; S-73 to K-467; W-74 to K-467; K-75 to K-467; 
A-76 to K-467; P-77 to K-467; V-78 toK-467; A-79 to K-467; T-80 to K-467; N-81 to 
K-467; N-82 to K-467; P-83 to K-467; A-84 to K-467; W-85 to K-467; A-86 to K- 
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467;V-87 to K-467; Q-88 to K-467; E-89 to K-467; E-90 to K-467; T-91 to K-467; R-92 
to K-467; D-93 to K-467; R-94 to K-467; F-95 toK-467; H-96 to K-467; L-97 to K-467; 
L-98 to K-467; G-99 to K-467; D-100 to K-467; P-101 to K-467; Q-102 to K-467; T-I03 
toK-467; K-104 to K-467; N-105 to K-467; C-106 to K-467; T-107 to K-467; L-108 to 
5 K-467; S-109 to K-467; 1-1 10 to K-467; R-lll toK-467; D-l 12 to K-467; A-l 13 to K- 
467; R-l 14 to K-467; M-l 15 to K-467; S-l 16 to K-467; D-l 17 to K-467; A-l 18 to K- 
467; G-l 19 toK-467; R-120 to K-467; Y-121 to K-467; F-122 to K-467; F-123 to K-467; 
R-124 to K-467; M-125 to K-467; E-126 to K-467; K-127 toK-467; G-128 to K-467; N- 
129 to K-467; 1-130 to K-467; K-131 to K-467; W-132 to K-467; N-133 to K-467; Y- 
10 134 to K-467; K-135to K-467; Y-136 to K-467; D-137 to K-467; Q-138 to K-467; L-139 
to K-467; S-140 to K-467; V-141 to K-467; N-142 to K-467; V-143to K-467; T-144 to 
K-467; A-145 to K-467; L-146 to K-467; T-147 to K-467; H-148 to K-467; R-149 to K- 
467; P-150 to K-467; N-151to K-467; 1-152 to K-467; L-153 to K-467; 1-154 to K-467; 
P-155 to K-467; G-156 to K-467; T-157 to K-467; L-158 to K-467; E-159 toK-467; S- 

15 160 to K-467; G-161 to K-467; C-162 to K-467; F-163 to K-467; Q-164 to K-467; N- 
165 to K-467; L-166 to K-467; T-167 toK-467; C-168 to K-467: S-169 to K-467; V-170 
to K-467; P-171 to K-467; W-172 to K-467; A-173 to K-467; C-174 to K-467; E-175 
toK-467; Q-176 to K-467; G-177 to K-467; T-178 to K-467; P-179 to K-467; P-180 to 
K-467; M-181 to K-467; 1-182 to K-467; S-183 toK-467; W-l 84 to K-467; M-185 to K- 

20 467; G-186 to K-467; T-187 to K-467; S-188 to K-467; V-189 to K-467; S-190 to K- 
467; P-191 toK-467; L-192 to K-467; H-193 to K-467; P-194 to K-467; S-195 to K-467; 
T-196 to K-467; T-197 to K-467; R-198 to K-467; S-199 toK-467; S-200 to K-467; V- 
201 to K-467; L-202 to K-467; T-203 to K-467; L-204 to K-467; 1-205 to K-467; P-206 
to K-467; Q-207 toK-467; P-208 to K-467; Q-209 to K-467; H-210 to K-467; H-211 to 

25 K-467; G-212 to K-467; T-213 to K-467; S-214 to K-467; L-215 toK-467; T-216 to K- 
467; C-217 to K-467; Q-218 to K^67; V-219 to K-467; T-220 to K-467; L-221 to K- 
467; P-222 to K-467; G-223 toK-467; A-224 to K-467; G-225 to K-467; V-226 to K- 
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467; T-227 to K-467; T-228 to K-467; N-229 to K-467; R-230 to K-467; T-231 toK-467; 
1-232 to K-467; Q-233 to K-467; L-234 to K-467; N-235 to K-467; V-236 to K-467; S- 
237 to K-467; Y-238 to K-467; P-239 toK-467; P-240 to K-467; Q-241 to K-467; N-242 
to K-467; L-243 to K-467; T-244 to K-467; V-245 to K-467; T-246 to K-467; V-247 
5 toK-467; F-248 to K-467; Q-249 to K-467; G-250 to K-467; E-251 to K-467; G-252 to 
K-467; T-253 to K-467; A-254 to K-467; S-255 toK-467; T-256 to K-467; A-257 to K- 
467; L-258 to K-467; G-259 to K-467; N-260 to K-467; S-261 to K-467: S-262 to K- 
467; S-263 toK-467; L-264 to K-467; S-265 to K-467; V-266 to K-467; L-267 to K-467; 
E-268 to K-467: G-269 to K-467; Q-270 to K-467; S-271 toK-467; L-272 to K-467; R- 

10 273 to K-467; L-274 to K-467; V-275 to K-467; C-276 to K-467; A-277 to K-467: V- 
278 to K-467; D-279 toK-467; S-280 to K-467; N-281 to K-467; P-282 to K-467; P-283 
to K-467; A-284 to K-467; R-285 to K-467; L-286 to K-467; S-287 toK-467; W-288 to 
K-467; T-289 to K-467; W-290 to K-467; R-291 to K-467; S-292 to K-467; L-293 to K- 
467; T-294 to K-467; L-295 toK-467; Y-296 to K-467; P-297 to K-467: S-298 to K-467; 

15 Q-299 to K-467; P-300 to K-467; S-301 to K-467; N-302 to K-467; P-303 toK^67; L- 
304 to K-467; V-305 to K-467; L-306 to K-467; E-307 to K-467; L-308 to K-467; Q-309 
to K-467; V-310 to K-467; H-31 1 toK-467; L-312 to K-467; G-313 to K-467; D-314 to 
K-467; E-315 to K-467; G-316 to K-467; E-317 to K-467; F-318 to K-467: T-319 toK- 
467; C-320 to K-467; R-321 to K-467; A-322 to K-467; Q-323 to K-467; N-324 to K- 

20 467; S-325 to K-467; L-326 to K-467; G-327 toK-467; S-328 to K-467; Q-329 to K-467; 
H-330 to K-467; V-331 to K-467; S-332 to K-467; L-333 to K-467; N-334 to K-467; L- 
335 toK-467; S-336 to K-467; L-337 to K-467; Q-338 to K-467; Q-339 to K-467; E-340 
to K-467; Y-341 to K-467; T-342 to K-467; G-343 toK-467; K-344 to K-467; M-345 to 
K-467; R-346 to K-467; P-347 to K-467; V-348 to K-467; S-349 to K-467; G-350 to K- 

25 467; V-351 toK-467; L-352 to K-467; L-353 to K-467; G-354 to K-467; A-355 to K- 
467; V-356 to K-467; G-357 to K-467; G-358 to K-467; A-359 toK-467; G-360 to K- 
467; A-361 to K-467; T-362 to K-467; A-363 to K-467; L-364 to K-467; V-365 to K- 
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467; F-366 to K-467; L-367 toK-467; S-368 to K-467; F-369 to K-467; C-370 to K-467; 
V-371 to K-467; 1-372 to K-467; F-373 to K-467; 1-374 to K-467; V-375 toK-467; V- 
376 to K-467; R-377 to K-467; S-378 to K-467; C-379 to K-467; R-380 to K-467; K-381 
to K-467; K-382 to K-467; S-383 toK-467; A-384 to K-467; R-385 to K-467; P-386 to 
K-467; A-387 to K-467; A-388 to K-467; D-389 to K-467; V-390 to K-467; G-391 toK- 
467; D-392 to K-467; 1-393 to K-467; G-394 to K-467; M-395 to K-467; K-396 to K- 
467; D-397 to K-467; A-398 to K-467; N-399 toK-467; T-400 to K-467; 1-401 to K-467; 
R-402 to K-467; G-403 to K-467; S-404 to K-467; A-405 to K-467; S-406 to K-467; Q- 
407 toK-467; G-408 to K-467: N-409 to K-467; L-410 to K-467; T-41 1 to K-467; E-412 
to K-467; S-413 to K-467; W-414 to K-467; A-415 toK-467; D-416 to K-467; D-417 to 
K-467; N-418 to K-467; P-419 to K-467; R-420 to K-467; H-421 to K-467; H-422 to K- 
467; G-423to K-467; L-424 to K-467; A-425 to K-467; A-426 to K-467; H-427 to K- 
467; S-428 to K-467; S-429 to K-467; G-430 to K-467: E-431to K-467; E-432 to K-467; 
R-433 to K-467; E-434 to K-467; 1-435 to K-467; Q-436 to K-467; Y-437 to K-467; A- 
438 to K-467; P-439to K-467; L-440 to K-467: S-441 to K-467; F-442 to K-467; H-443 
to K-467; K-444 to K-467; G-445 to K-467; E-446 to K-467; P-447to K-467; Q-448 to 
K-467; D-449 to K-467; L-450 to K-467; S-451 to K-467; G-452 to K-467; Q-453 to K- 
467; E-454 to K-467; A-455to K-467; T-456 to K-467; N-457 to K-467; N-458 to K- 
467; E-459 to K-467; Y-460 to K-467; S-461 to K-467; E-462 to K-467; of SEQ ID 
NO:34. Polypeptides encoded by these polynucleotides are also encompassed by the 
invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities may still be retained. For example the 
ability of the shortened CD33-like 3 mutein to induce and/or bind to antibodies which 
recognize the complete or mature forms of the polypeptide generally will be retained 
when less than the majority of the residues of the complete or mature polypeptide are 
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removed from the C-terminus. Whether a particular polypeptide lacking C-terminal 
residues of a complete polypeptide retains such immunologic activities can readily be 
determined by routine methods described herein and otherwise known in the art. It is not 
unlikely that a CD33-like 3 mutein with a large number of deleted C-terminal amino acid 
5 residues may retain some biological or immunogenic activities. In fact, peptides 

composed of as few as six CD33-like 3 amino acid residues may often evoke an immune 
response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 

10 CD33-like 3 polypeptide shown in Figures 16A-B , up to the leucine residue at position 
number 6, and polynucleotides encoding such polypeptides. In particular, the present 
invention provides polypeptides comprising the amino acid sequence of residues 1-mJ of 
Figure 1 , where ml is an integer from 6 to 467 corresponding to the position of the 
amino acid residue in Figures I6A-B . 

15 Moreover, the invention provides polynucleotides encoding polypeptides comprising, or 
alternatively consisting of, the amino acid sequence of C-terminal deletions of the CD33- 
like 3 polypeptide of the invention shown as SEQ ID NO:34 include polypeptides 
comprising the amino acid sequence of residues: M-l to P-466; M-l to 1-465; M-I to K- 
464; M-l to 1-463; M-l to E-462; M-l to S-461; M-l to Y-460; M-l toE-459; M-l to N- 

20 458; M-l to N-457; M-l to T-456; M-l to A-455; M-l to E-454; M-l to Q-453; M-l to 
G-452; M-l to S-451; M-l toL-450; M-l to D-449; M-l to Q-448; M-l to P-447; M-l to 
E-446; M-l to G-445; M-l to K-444; M-l to H-443; M-l to F-442; M-l toS-441; M-l to 
L-440; M-l to P-439; M-l to A-438; M-l to Y-437; M-l to Q-436; M-l to 1-435; M-l to 
E-434; M-l to R-433; M-l toE-432; M-l to E-431 ; M-l to G-430; M-l to S-429; M-l to 

25 S-428; M-l to H-427; M-l to A-426; M-l to A-425; M-l to L-424; M-l toG-423; M-l to 
H-422;M-1 toH-421;M-l toR-420;M-l to P-419; M-l toN-418;M-l toD-417;M-l 
toD-416; M-l to A-415;M-1 toW-414; M-i toS-413;M-l toE-412; M-l toT-411;M-l 
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to L-410; M-l to N-409; Ml to G-408; M-l to Q-407; M-l to S-406; M-l toA-405; M-l 
to S-404; M-l to G-403; M-l to R-402; M-l to 1-401; M-l to T-400; M-l to N-399; M-l 
to A-398; M-l to D-397; M-l toK-396; M-l to M-395; M-l to G-394; M-l to 1-393; M-l 
to D-392; M-l to G-391; M-l to V-390; M-l to D-389; M-l to A-388; M-l toA-387; M- 
5 1 to P-386; M-l to R-385; M-l to A-384; M-l to S-383; M-l to K-382; M-l to K-381; 
M-l to R-380; M-l to C-379; M-l toS-378; M-l to R-377; M-l to V-376; M-l to V-375; 
M-l to 1-374; M-l to F-373; M-l to 1-372; M-l to V-371; M-l to C-370; M-l toF-369; 
M-l to S-368; M-l to L-367; M-l to F-366; M-l to V-365; M-l to L-364; M-l to A-363; 
M-l to T-362; M-l to A-361; M-l toG-360; M-l to A-359; M-l to G-358; M-l to G- 
10 357; M-l to V-356; M-l to A-355; M-l to G-354; M-l to L-353; M-l to L-352; M-l 

toV-351; M-l to G-350; M-l to S-349; M-l to V-348; M-l to P-347; M-l to R-346; M-l 
to M-345; M-l to K-344; M-l to G-343; M-l toT-342; M-l to Y-341; M-l to E-340; M- 
1 to Q-339; M-l to Q-338; M-l to L-337; M-l to S-336; M-l to L-335; M-l to N-334; 
M-l toL-333; M-l to S-332; M-l to V-331; M-l to H-330; M-l to Q-329; M-l to S-328; 
15 M-l to G-327; M-l to L-326; M-l to S-325; M- 1 toN-324; M-l to Q-323; M-l to A-322; 
M-l toR-321;M-l toC-320; M-l toT-319; M-l toF-318;M-l toE-317;M-l toG-316; 
M-l toE-315; M-l to D-314; M-l to G-313; M-l toL-312; M-l to H-31 1; M-l to V-310; 
M-l to Q-309; M-l to L-308; M-l to E-307; M-l toL-306; M-l to V-305; M-l to L-304; 
M-l to P-303; M-l to N-302; M-l to S-301; M-l to P-300; M-l to Q-299; M-l to S-298; 
20 M-l toP-297; M-l to Y-296; M-l to L-295; M-l to T-294; M-l to L-293; M-l to S-292; 
M-l to R-291; M-l to W-290; M-l to T-289; M-l toW-288; M-l to S-287; M-l to L- 
286; M-l to R-285; M-l to A-284; M-l to P-283; M-l to P-282; M-l to N-281; M-l to 
S-280; M-l toD-279; M-l to V-278; M-l to A-277; M-l to C-276; M-l to V-275; M-l to 
L-274; M-l to R-273; M-l to L-272; M-l to S-271; M-l toQ-270; M-l to G-269; M-l to 
25 E-268; M-l to L-267; M-l to V-266; M-l to S-265; M-l to L-264; M-l to S-263; M-l to 
S-262; M-l toS-261; M-l to N-260; M-l to G-259; M-l to L-258; M-l to A-257; M-l to 
T-256; M-l to S-255; M-l to A-254; M-l to T-253; M-l toG-252; M-l to E-251; M-l to 
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G-250; M-l to Q-249; M-l to F-248; M-l to V-247; M-l to T-246; M-l to V-245; M-l 
to T-244; M-l toL-243; M-l to N-242; M-l to Q-241; M-l to P-240; M-l to P-239; M-l 
to Y-238; M-l to S-237; M-l to V-236; M-l to N-235; M-l toL-234; M-l to Q-233; M-l 
to 1-232; M-l to T-231; M-l to R-230; M-l to N-229; M-l to T-228; M-l to T-227; M-l 
5 to V-226; M-l toG-225; M-l to A-224; M-l to G-223; M-l to P-222; M-l to L-221; M-l 
toT-220; M-l toV-219;M-l toQ-218;M-l toC-217; M-l toT-216; M-l toL-215;M-l 
toS-214; M-l toT-213;M-l toG-212;M-l toH-211;M-l toH-210; M-l toQ-209;M- 
1 to P-208; M-l toQ-207; M-l to P-206; M-l to 1-205; M-l to L-204; M-l toT-203; M-l 
to L-202; M-l to V-201; M-l to S-200; M-l to S-199; M-l toR-198; M-l to T-197; M-l 

10 toT-196;M-l toS-195;M-l toP-194;M-l toH-I93;M-l toL-192:M-l toP-191:M-l 
toS-190; M-l toV-189; M-l toS-188;M-l toT-187;M-l toG-186; M-l to M- 185; M-l 
toW-184;M-l toS-183;M-l to 1-182; M-l to M-18UM-1 toP-180; M-l toP-179; M-l 
to T-178; M-l to G-177; M-l to Q-176; M-l to E-175; M-l to C-174; M-l to A-173; M- 
1 to W-172; M-l toP-171; M-l to V-170: M-l to S-169; M-l to C-168; M-l to T-167; 

15 M-l to L-166; M-l to N-165; M-l to Q-164; M-l to F-163; M-l toC-162; M-l to G-161; 
M-l to S-160; M-l to E-159; M-l to L-158; M-l to T-157; M-l to G-156; M-l to P-155; 
M-l to 1-154; M-l toL-153; M-l to 1-152; M-l toN-151;M-l toP-150;M-l toR-149; 
M-l to H-148; M-l toT-147; M-l to L-146; M-l to A-145; M-l toT-144; M-l to V-143: 
M-l to N-142;M-1 toV-141;M-l toS-140;M-l toL-139;M-l toQ-138; M-l toD- 

20 137; M-l to Y-136; M-l toK-135; M-l to Y-134; M-l to N-133; M-I to W-132; M-l to 
K-131;M-1 to 1-130; M-I toN-129;M-l toG-128; M-l toK-127;M-l toE-126;M-l to 
M-125; M-l to R-124; M-l to F-123; M-l to F-122; M-l to Y-121; M-l to R-120; M-l 
to G-l 19; M-l to A-l 18; M-l toD-1 17; M-l to S-l 16; M-l to M-l 15; M-l to R-l 14; M- 
1 to A-l 13; M-l to D-l 12; M-l to R-l 1 1; M-l to 1-1 10; M-l to S-109; M-l toL-108; M- 

25 1 toT-107; M-l to C-106; M-l to N-105; M-l to K-104; M-l toT-103; M-l to Q-102; 
M-l to P-101; M-l to D-100; M-l toG-99; M-l to L-98; M-l to L-97; M-l to H-96; M-l 
to F-95; M-l to R-94; M-l to D-93; M-l to R-92; M-l to T-91; M-l to E-90; M-l toE- 
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89; M-l to Q-88; M-l to V-87; M-l to A-86; M-l to W-85; M-l to A-84; M-l to P-83; 
M-l to N-82; M-l to N-81; M-l to T-80; M-l toA-79; M-l to V-78; M-l to P-77; M-l to 
A-76; M-l to K-75; M-l to W-74; M-l to S-73; M-l to 1-72; M-l to D-71; M-l to N-70; 
M-l toG-69; M-l to A-68; M-l to R-67; M-l to F-66; M-l to W-65; M-l to Y-64; M-l 
to G-63; M-l to H-62; M-l to V-61; M-l to P-60; M-l toD-59; M-l to S-58; M-l to D- 
57; M-l to T-56; M-l to Q-55; M-l to S-54; M-l to D-53; M-l to V-52; M-l to P-51: M- 
1 to Y-50; M-l toS-49; M-l to F-48; M-l to S-47; M-l to C-46; M-l to R-45; M-l to V- 
44; M-l to H-43; M-l to V-42; M-l to C-41; M-l to M-40; M-l toG-39; M-l to E-38: 
M-l to Q-37; M-l to V-36; M-l to T-35; M-l to V-34: M-l to S-33; M-l to S-32; M-l to 
Q-31; M-l to M-30; M-l toT-29; M-l to L-28; M-l to S-27: M-l to Y-26; M-l to D-25; 
M-l toK-24; M-l to R-23; M-l to N-22; M-l toS-21; M-l to K-20: M-l toQ-19; M-l to 
G-18; M-l toE-17; M-l to V-16: M-l to R-15; M-l to E-14; M-l to R-13; M-l to G-12; 
M-l to W-l 1; M-l to L-10; M-l toL-9; M-l to P-8; M-l to L-7; M-l to L-6; of SEQ ID 
NO:34. Polypeptides encoded by these polynucleotides are also encompassed by the 
invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO: 16 which have been determined 
from the following related cDNA genes: HGBAY02R (SEQ ID NO: 100) and 
HLYBY62R (SEQ ID NO:101). 

Based on the sequence similarity to the human CD33L1, translation product of 
this gene is expected to share at least some biological activities with CD33 proteins, and 
specifically myeloid modulatory proteins and/or siglec proteins. Such activities are 
known in the art, some of which are described elsewhere herein. 

Specifically, polynucleotides and polypeptides of the invention are also useful for 
modulating the differentiation of normal and malignant cells, modulating the 
proliferation and/or differentiation of cancer and neoplastic cells, and modulating the 
immune response. Polynucleotides and polypeptides of the invention may represent a 
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diagnostic marker for hematopoietic and immune diseases and/or disorders. The full- 
length protein should be a secreted protein, based upon homology to the CD33 family. 
Therefore, it is secreted into serum, urine, or feces and thus the levels is assayable from 
patient samples. Assuming specific expression levels are reflective of the presence of 
immune disorders, this protein would provide a convenient diagnostic for early detection. 
In addition, expression of this gene product may also be linked to the progression of 
immune diseases, and therefore may itself actually represent a therapeutic or therapeutic 
target for the treatment of cancer. 

Polynucleotides and polypeptides of the invention may play an important role in 
the pathogenesis of human cancers and cellular transformation, particularly those of the 
immune and hematopoietic systems. Polynucleotides and polypeptides of the invention 
may also be involved in the pathogenesis of developmental abnormalities based upon its 
potential effects on proliferation and differentiation of cells and tissue ceil types. Due to 
the potential proliferating and differentiating activity of said polynucleotides and 
polypeptides, the invention is useful as a therapeutic agent in inducing tissue 
regeneration, for treating inflammatory conditions (e.g., inflammatory bowel syndrome, 
diverticulitis, etc.). Moreover, the invention is useful in modulating the immune 
response to aberrant polypeptides, as may exist in rapidly proliferating cells and tissue 
cell types, particularly in adenocarcinoma cells, and other cancers. 

This gene is expressed predominantly on NK cells, and to a lesser extent on T- 
cells. Therefore, polynucleotides and polypeptides of the invention are useful as reagents 
for differential identification of the tissue(s) or cell type(s) present in a biological sample 
and for diagnosis of the following diseases and conditions which include, but are not 
limited to, immune disorders and cancer, as well as the immunodiagnosis of acute 
leukemias. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) or 
cell type(s). For a number of disorders of the above tissues or cells, particularly of the 
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immune system, and breast tissue, expression of this gene at significantly higher or 
lower levels is detected in certain tissues or cell types (e.g. immune, cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 34 as residues: Gly-12 to Tyr-26, VaI-52 to Asp-59, Gln-88 to 
Asp-93, Arg-124 to Asn-129, His-193 to Arg-198, Gln-207 to Thr-213, Gln-338 to Arg- 
346, Ser-378 to AIa-384, Ser-413 to Arg-420, Ser-428 to Glu-434, His-443 to Ser-451, 
Glu-454 to Ser-46L Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in NK cells, in combination with the homology to siglec 
family of proteins indicates the protein product of this gene is useful for the diagnosis 
and treatment of a variety of immune system disorders. NK cells are bone-marrow 
derived granular lymphocytes that play an important role in natural immunity to 
infectious diseases and have the capacity to kill certain virally-infected cells and tumor 
cells that have down-regulated MHC Class-I antigen expression. The killing and 
proinflammatory activities of NK cells are regulated through a variety of cell surface 
receptors that can mediate either activity or inhibitory signals. The best understood 
receptors are those that recognize MHC Class I molecules at the cell surface and deliver 
a negative signal, thereby protecting normal host cells from cytotoxicity. These 
receptors can belong either to the C-type lectin superfamily or the Ig superfamily, 
although in humans the majority are members of the Ig superfamily known as killer cell 
Ig-like receptors (KIRs). Representative uses are described in the "Immune Activity" and 
"infectious disease" sections below, in Example 11, 13, 14, 16, 18, 19, 20, and 27, and 
elsewhere herein. Briefly, the expression indicates a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including blood 
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stem cells. Involvement in the regulation of cytokine production, antigen presentation, 
or other processes indicates a usefulness for treatment of cancer (e.g. by boosting 
immune responses). Expression in cells of lymphoid origin, indicates the natural gene 
product is involved in immune functions. Therefore it would also be useful as an agent 
for immunological disorders including arthritis, asthma, immunodeficiency diseases 
such as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, such 
as T-cell mediated cytotoxicity; immune reactions to transplanted organs and tissues, 
such as host-versus-graft and graft-versus-host diseases, or autoimmunity disorders, such 
as autoimmune infertility, lense tissue injury, demyelination, systemic lupus 
erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's Disease, 
and scleroderma. Moreover, the protein may represent a secreted factor that influences 
the differentiation or behavior of other blood cells, or that recruits hematopoietic cells 
to sites of injury. Thus, this gene product is thought to be useful in the expansion of 
stem cells and committed progenitors of various blood lineages, and in the differentiation 
and/or proliferation of various cell types. Based upon the tissue distribution of this 
protein, antagonists directed against this protein is useful in blocking the activity of this 
protein. Accordingly, preferred are antibodies which specifically bind a portion of the 
translation product of this gene. 

Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 
product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 
for the translation product of this gene to a bodily fluid from the individual, preferably 
serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 
above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
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are more particularly described elsewhere herein. Furthermore, the protein may also be 
used to determine biological activity, raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
5 against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 16 and may have been publicly available prior to conception of the present 

10 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 1734 of SEQ ID NO: 16, b is an integer of 15 to 

15 1748, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 16, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

20 

This invention relates to newly identified polynucleotides, polypeptides encoded 
by such polynucleotides, the use of such polynucleotides and polypeptides, as well as the 
production of such polynucleotides and polypeptides. The polypeptide of the present 
invention has been putatively identified as a human integrin alpha 1 1 homolog derived 
25 from a human osteoblast II cDNA library. More particularly, the polypeptide of the 
present invention has been putatively identified as a human integrin alpha 11-subunit 
homolog, sometimes hereafter referred to as "integrin alpha 11", "integrin alpha 11- 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/2503I 

104 

subunit", "all", "All-subunit", and/or "Integrin all-subunit". The invention also relates 
to inhibiting the action of such polypeptides. 

The integrins are a large family of cell adhesion molecules consisting of noncovalently 
associated ab heterodimers. 

We have cloned and sequenced a novel human integrin a -subunit cDNA, 
designated al 1. The al 1 cDNA encodes a protein with a 22 amino acid signal peptide, a 
large 1 120 residue extracellular domain that contains an I-domain of 207 residues and is 
linked by a transmembrane domain to a short cytoplasmic domain of 24 amino acids. The 
deduced all protein shows the typical structural features of integrin a-subunits and is 
similar to a distinct group of a-subunits from collagen-binding integrins. However, it 
differs from most integrin a-chains by an incompletely preserved cytoplasmic GFFKR 
motif. 

The human ITGA1 1 gene was located to bands q22.3-23 on chromosome 15, and 
its transcripts were found predominantly in bone, cartilage as well as in cardiac and 
skeletal muscle. Expression of the 5.5 kilobase all mRNA was also detectable in ovary 
and small intestine. 

All vertebrate cells express members of the integrin family of cell adhesion molecules, 
which mediate cellular adhesion to other cells and extracellular subtratum, cell migration 
and participate in important physiologic processes from signal transduction to cell 
proliferation and differentiation {Hynes, 92; Springer, 92 }. 

Integrins are structurally homologous heterodimeric type-I membrane 
glycoproteins formed by the noncovalent association of one of eight b -subunits with one 
of the 17 different a-subunits described to date, resulting in at least 22 different ab 
complexes. Their binding specificities for cellular and extracellular ligands are 
determined by both subunits and are dynamically regulated in a cell-type-specific mode 
by the cellular environment as well as by the developmental and activation state of the 
cell {Diamond and Springer, 94}. In integrin a -subunits, the aminoterminal region of the 
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large extracellular domain consists of a seven-fold repeated structure which is predicted 
to fold into a b -propeller domain {Corbi et al., 1987; Springer, 1997 }. The three or four 
C-terminal repeats contain putative divalent cation binding motifs that are thought to be 
important for ligand binding and subunit association {Diamond and Springer, 94}. The 
al, a2, a 10, aD, aE, aL, aM and aX-subunits contain an approximately 200 amino acid I- 
domain inserted between the second and third repeat that is not present in other a-chains 
{Larson et al., 1989}. Several isolated 1-domains have been shown to independently bind 
the ligands of the parent integrin heterodimer {Kamata and Takada, 1994; Randi and 
Hogg, 1994}. The a3, a5-8, allb and aV-subunits are proteolytically processed at a 
conserved site into disulphide-linked heavy and light chains, while the a4-subunit is 
cleaved at a more aminoterminal site into two fragments that remain noncovalently 
associated {Hemler et al., 90}. Additional a-subunit variants are generated by alternative 
splicing of primary transcripts {Ziober et al., 93; Delwel et al., 95; Leung et al., 98}. 

The extracellular domains of a-integrin subunits are connected by a single 
spanning transmembrane domain to short, diverse cytoplasmic domains whose only 
conserved feature is a membrane-proximal KXGFF(K/R)R motif {Sastry and Horwitz, 
1993}. The cytoplasmic domains have been implicated in the cell-type-specific 
modulation of integrin affinity states {Williams et al., 1994}. 

The polypeptide of the present invention has been putatively identified as a 
member of the integrin family and has been termed integrin alpha 1 1 subunit ("al 1"). 
This identification has been made as a result of amino acid sequence homology to the 
human integrin alpha 1 subunit (See Genbank Accession No. gi|346210). 

Figures 19A-F show the nucleotide (SEQ ID NO: 17) and deduced amino acid 
sequence (SEQ ID NO:35) of al 1. Predicted amino acids from about 1 to about 22 
constitute the predicted signal peptide (amino acid residues from about 1 to about 22 in 
SEQ ID NO:35) and are represented by the underlined amino acid regions; amino acids 
from about 666 to about 682, and/or amino acids from about 1 145 to about 1 161 
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constitute the predicted transmembrane domains (amino acids from about 666 to about 
682, and/or amino acids from about 1 145 to about 1 161 in SEQ ID NO:35) and are 
represented by the double underlined amino acids; and amino acids from about 64 to 
about 96 constitute the predicted immunoglobulin and major histocompatibility complex 
protein domain (amino acids from about 64 to about 96 in SEQ ID NO:35) and are 
represented by the bold amino acids. 

Figure 20 shows the regions of similarity between the amino acid 
sequences of the integrin alpha 1 1 subunit (ail) protein (SEQ ID NO:35) and the human 
integrin alpha 1 subunit (SEQ ID NO: 103). 

Figure 21 shows an analysis of the integrin alpha 1 1 subunit (al 1) amino acid 
sequence. Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; 
amphipathic regions; flexible regions; antigenic index and surface probability are shown. 

A polynucleotide encoding a polypeptide of the present invention is obtained 
from human ovary ,small intestine, fetal heart, fetal brain, large intestine, osteoblasts, 
human trabecular bone cells, messangial cells, adipocytes, osteosarcoma, 
chondrosarcoma, breast cancer cells, and bone marrow tissues and cells. The 
polynucleotide of this invention was discovered in a human osteoblast II cDNA library. 
Its translation product has homology to the characteristic immunoglobulin and major 
histocompatibility complex protein domain of integrin family members. As shown in 
Figures 19A-F, all has transmembrane domains (the transmembrane domains comprise 
amino acids 666 - 682 and/or 1 145 - 1 161 of SEQ ID NO:35; which correspond to 
amino acids 666 - 682 and/or 1 145 - 1 161 of Figures 19A-F) with strong conservation 
between other members of the integrin family. The polynucleotide contains an open 
reading frame encoding the al 1 polypeptide of 1 189 amino acids. The present invention 
exhibits a high degree of homology at the amino acid level to the human integrin alpha 1 
subunit (as shown in Figure 20). 

Preferred polypeptides of the invention comprise the following amino acid 
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sequence: TNG YQKTGD V YKCP VIHGNCTKLNLGRVTLS N V (SEQ ID NO: 102). 
Polynucleotides encoding these polypeptides are also provided. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the al 1 polypeptide having the amino acid sequence shown in 
Figures I9A-F (SEQ ID NO:35). The nucleotide sequence shown in Figures 19A-F 
(SEQ ID NO:35) was obtained by sequencing a cloned cDNA (HOHBY69), which was 
deposited on November 17 at the American Type Culture Collection, and given 
Accession Number 203484. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO: 17 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO: 17. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 17. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of all polynucleotide fragments of the invention 
include, for example, fragments that comprise, or alternatively, consist of, a sequence 
from about nucleotide 1 to about 50, from about 51 to about 100, from about 101 to 
about 150, from about 151 to about 200, from about 201 to about 250, from about 251 to 
about 300, from about 301 to about 350, from about 351 to about 400, from about 401 to 
about 450, from about 451 to about 500, from about 501 to about 550, from about 551 to 
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about 600, from about 601 to about 650, from about 651 to about 700, from about 701 to 
about 750, from about 751 to about 800, from about 801 to about 850, from about 851 to 
about 900, from about 901 to about 950, from about 951 to about 1000, from about 1001 
to about 1050, from about 105 1 to about 1100, from about 1 101 to about 1 150, from 
5 about 1151 to about 1200, from about 1201 to about 1250, from about 1251 to about 
1300, from about 1301 to about 1350, from about 1351 to about 1400, from about 1401 
to about 1450, from about 1451 to about 1500, from about 1501 to about 1550, from 
about 1551 to about 1600, from about 1601 to about 1650, from about 1651 to about 
1700, from about 1701 to about 1750, from about 1751 to about 1800, from about 1801 

10 to about 1850, from about 1851 to about 1900, from about 1901 to about 1950, from 
about 1951 to about 2000, from about 2001 to about 2050, from about 2051 to about 
2100, from about 2101 to about 2150, from about 2151 to about 2200, from about 2201 
to about 2250, from about 2251 to about 2300, from about 2301 to about 2350, from 
about 2351 to about 2400, from about 2401 to about 2450, from about 2451 to about 

15 2500, from about 2501 to about 2550, from about 255 1 to about 2600, from about 2601 
to about 2650, from about 2651 to about 2700, from about 2701 to about 2750, from 
about 2751 to about 2800, from about 2801 to about 2850, from about 2851 to about 
2900, from about 2901 to about 2950, from about 2951 to about 3000, from about 3001 
to about 3050, from about 3051 to about 3 100, from about 3 101 to about 3 150, from 

20 about 3151 to about 3200, from about 3201 to about 3250, from about 3251 to about 
3300, from about 3301 to about 3350, from about 3351 to about 3400, from about 3401 
to about 3450, from about 345 i to about 3500, from about 3501 to about 3550, from 
about 3551 to about 3600, from about 3601 to about 3650, from about 3651 to about 
3700, from about 3701 to about 3750, from about 3751 to about 3800, from about 3801 

25 to about 3850, from about 3851 to about 3900, from about 3901 to about 3950, from 
about 3951 to about 4000, from about 4001 to about 4050, from about 4051 to about 
4100, from about 4101 to about 4150, from about 4151 to about 4200, from about 4201 
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to about 4250, from about 4251 to about 4300, from about 4301 to about 4350, from 
about 4351 to about 4400, from about 4401 to about 4450, from about 4451 to about 
4500, from about 4501 to about 4550, from about 4551 to about 4600, from about 4601 
to about 4650, from about 4651 to about 4700, from about 4701 to about 4750, from 
about 4751 to about 4800, from about 4801 to about 4850, from about 4851 to about 
4900, from about 4901 to about 4950, from about 4951 to about 4995, from about, from 
about 1 to about 236, from about 144 to about 188, from about 231 to about 276 of SEQ 
ID NO: 17, or the complementary strand thereto, or the cDNA contained in the deposited 
gene. In this context "about" includes the particularly recited ranges, larger or smaller by 
several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. 

Preferred nucleic acid fragments of the present invention include nucleic acid 
molecules encoding a member selected from the group: a polypeptide comprising or 
alternatively, consisting of, any one of the transmembrane domains (amino acid residues 
from about 666 to about 682 and/or 1 145 to about 1 161 in Figures 19A-F (amino acids 
from about 666 to about 682 and/or 1 145 to about 1 161 in SEQ ID NO:35), in addition to 
the immunoglobulin and major histocompatibility complex protein domain (amino acid 
residues from about 64 to about 96 in Figures 19A-F (amino acids from about 64 to 
about 96 in SEQ ID NO:35). Since the location of these domains have been predicted by 
computer analysis, one of ordinary skill would appreciate that the amino acid residues 
constituting these domains may vary slightly (e.g., by about 1 to 15 amino acid residues) 
depending on the criteria used to define each domain. In additional embodiments, the 
polynucleotides of the invention encode functional attributes of al 1 . 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
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regions, surface-forming regions and high antigenic index regions of the present 
invention. 

The data representing the structural or functional attributes of all set forth in 
Figure 21 and/or Table VII, as described above, was generated using the various modules 
and algorithms of the DNA*STAR set on default parameters. In a preferred 
embodiment, the data presented in columns VIII, IX, XIII, and XIV of Table VII can be 
used to determine regions of al 1 which exhibit a high degree of potential for 
antigenicity. Regions of high antigenicity are determined from the data presented in 
columns VIII, IX, XIII, and/or XIV by choosing values which represent regions of the 
polypeptide which are likely to be exposed on the surface of the polypeptide in an 
environment in which antigen recognition may occur in the process of initiation of an 
immune response. 

Certain preferred regions in these regards are set out in Figure 21, but may, as 
shown in Table VII, be represented or identified by using tabular representations of the 
data presented in Figure 21. The DNA*STAR computer algorithm used to generate 
Figure 21 (set on the original default parameters) was used to present the data in Figure 
21 in a tabular format (See Table VII). The tabular format of the data in Figure 21 is 
used to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 21 and in Table VII include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figures 19A-F. As set out in Figure 21 and in Table VII, such preferred regions 
include Garnier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, 
Chou-Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic 
regions and Hopp-Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic 
regions, Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index 
and Emini surface-forming regions. Even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification of loss of one or more biological 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

111 

functions of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened al 1 
muteins to induce and/or bind to antibodies which recognize the complete or mature 
forms of the polypeptides generally will be retained when less than the majority of the 
residues of the complete or mature polypeptide are removed from the N-terminus. 
Whether a particular polypeptide lacking N-terminal residues of a complete polypeptide 
retains such immunologic activities can readily be determined by routine methods 
described herein and otherwise known in the art. It is not unlikely that an all mutein 
with a large number of deleted N-terminal amino acid residues may retain some 
biological or immunogenic activities. In fact, peptides composed of as few as six al 1 
amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the al 1 amino acid sequence shown in 
Figures 19A-F, up to the threonine residue at position number 1 184 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues nl-1189 of Figures 19A-F, where nl is 
an integer from 2 to 1 184 corresponding to the position of the amino acid residue in 
Figures 19A-F (which is identical to the sequence shown as SEQ ID NO:35). In another 
embodiment, N-terminal deletions of the all polypeptide can be described by the general 
formula n2-l 189, where n2 is a number from 2 to 1184, corresponding to the position of 
amino acid identified in Figure 19. N-terminal deletions of the al 1 polypeptide of the 
invention shown as SEQ ID NO:35 include polypeptides comprising the amino acid 
sequence of residues: N-terminal deletions of the al 1 polypeptide of the invention shown 
as SEQ ID NO:35 include polypeptides comprising the amino acid sequence of residues: 
D-2 to E-l 1 89; L-3 to E-l 189; P-4 to E-l 189; R-5 toE-1 189; G-6 to E-1189; L-7 to E- 
1189; V-8 to E-l 189; V-9 to E-1189;A-10 toE-1189; W-ll toE-1189; A-12 toE-1189; 
L-13 to E-l 189;S-14 to E-I189; L-15 to E-l 189; W-16 to E-l 189; P-17 to E-l 189;G-18 
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to E-1189; F-19 to E-l 189; T-20 to E-1189; D-21 to E-l 189;T-22 to E-1189; F-23 to E- 
1 189; N-24 to E-l 189; M-25 to E-l 189;D-26 to E-l 189; T-27 to E-l 189; R-28 to E- 
1 189; K-29 to E-l 189;P-30 to E-l 189; R-3 1 to E-l 1 89; V-32 to E-1189; 1-33 to E- 
1 189;P-34 to E-l 189; G-35 to E-l 189; S-36 to E-l 189; R-37 to E-l 189;T-38 to E-l 189; 
5 A-39 to E- 1 189; F-40 to E- 1 1 89; F-41 to E- 1 1 89;G-42 to E- 1 1 89; Y-43 to E- 1 1 89; T-44 
to E-l 189; V-45 to E-l 189;Q-46 to E-l 189; Q-47 to E-l 189; H-48 to E-l 189; D-49 to E- 
1 189;I-50 to E-l 189; S-5 1 to E-l 189; G-52 to E-l 189; N-53 to E-l 189;K-54 to E-l 189; 
W-55 to E-l 189; L-56 to E-l 1 89; V-57 to E-l 189;V-58 to E-l 189; G-59 to E-l 189; A- 
60 to E-l 189; P-61 to E-l 189;L-62 to E-l 189; E-63 to E-1189; T-64 to E-I 189; N-65 to 

10 E-l 189.G-66 to E-l 189; Y-67 to E-l 189; Q-68 to E-l 189; K-69 to E-l 189;T-70 to E- 
1 189; G-71 to E-l 189; D-72 to E-1189; V-73 to E-l 189;Y-74 to E-l 189; K-75 to E- 
1 189; C-76 to E-1189; P-77 to E-1189;V-78 to E-l 189; 1-79 to E-l 189; H-80 to E-l 189; 
G-81 to E-l 189;N-82 to E-l 189; C-83 to E-1189; T-84 to E-l 189; K-85 to E-l 189;L-86 
to E-l 189; N-87 to E-l 189; L-88 to E-1189; G-89 to E-1189;R-90 to E-1189; V-91 to E- 

15 1 1 89; T-92 to E-l 189; L-93 to E-l 189;S-94 to E-l 189; N-95 to E-l 189; V-96 to E-l 189; 
S-97 to E-1189;E-98 to E-l 189; R-99 to E-l 189; K-100 to E-l 189; D-101 to E-l 189;N- 
102 to E-l 189; M-103 to E-l 189; R-104 to E-1189; L-105 toE-1189; G-106 to E-l 189; 
L-107 to E-l 189; S-108 to E-l 189; L-109to E-l 189; A-110 to E-1189; T-l 1 1 to E-l 189; 
N-l 12 to E-l 189;P-113 to E-l 189; K-l 14 to E-1189; D-l 15 to E-1189; N-l 16 toE-1 189; 

20 S-l 17 to E-l 189; F-l 18 to E-l 189; L-l 19 to E-l 189; A-120to E-l 189; C-121 to E-l 189; 
S-122 to E-l 189; P-123 to E-l 189;L-124 to E-l 189; W-125 to E-1189; S-126 to E-l 189; 
H-127 toE-1 189; E-128 to E-l 189; C-129 to E-l 189; G-130 to E-1189; S-131to E-l 189; 
S-132 to E-1189; Y-133 to E-l 189; Y-134 to E-1189;T-135 to E-1189; T-136 to E-l 189; 
G-137toE-1189; M-138 toE-1189; C-139toE-1189; S-140 to E-1189; R-141 toE- 

25 1 189; V-142to E-l 189; N-143 to E-1189; S-144 to E-l 189; N-145 to E-l 189;F-146 to fi- 
ll 89; R-147 to E-l 189; F-148 to E-l 189; S-149 toE-1189; K-l 50 to E-l 189; T-l 51 to fi- 
ll 89; V- 152 to E- 1 1 89; A- 1 53to E- 1 1 89; P- 1 54 to E- 1 1 89; A- 1 55 to E- 1 1 89; L- 1 56 to 
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E-1189;Q-157 toE-1189; R-158 toE-1189; C-159 to E-1189; Q-160 toE-1189; T-161 to 
E-1189; Y-162 to E-1189; M-163 to E-1189; D-164to E-l 189; 1-165 to E-1189; V-166 to 
E-1189; 1-167 to E-1189; V-168to E-1189; L-169 to E-1189; D-170 to E-l 189; G-171 to 
E-1189.S-172 to E-l 189; N-173 to E-l 189; S-174 to E-1189; 1-175 toE-1189; Y-176 to 
5 E-1189; P-177 to E-l 189; W-178 to E-l 189; V-179to E-1189; E-l 80 to E-l 189; V-181 
to E- 1 1 89; Q- 1 82 to E- 1 1 89;H- 1 83 to E- 1 1 89; F- 1 84 to E- 1 1 89; L- 1 85 to E- 1 1 89; I- 1 86 
toE-1 189; N-187 to E-1189; 1-188 to E-1189; L-189 to E-1189; K-190to E-1189; K-191 
toE-1189;F-192 to E-1189; Y-193 to E-l 189;I-194 to E-1189; G-195 to E-1189; P-196 
to E-1189; G-197 toE-1 189; Q-198 to E-l 189; 1-199 to E-l 189; Q-200 to E-l 189; V- 
10 201 to E-l 189; G-202 to E-l 189; V-203 to E-l 189; V-204 to E-l 189;Q-205 to E-l 189; 
Y-206 to E-l 1 89; G-207 to E-l 189; E-208 toE-1 1 89; D-209 to E-l 189; V-210 to fi- 
lls^ V-21 1 to E-l 189; H-212to E-l 189; E-213 to E-l 189; F-214 to E-1189; H-215 to 
E-1189;L-216 to E-l 189; N-217 to E-l 189; D-218 to E-l 189; Y-219 toE-1189; R-220to 
E-l 189; S-221 to E-1189; V-222 to E-l 189; K-223to E-1189; D-224 to E-l 189; V-225 

15 to E-1189; V-226 to E-l 189;E-227 to E-l 189; A-228 to E-l 189; A-229 to E-l 189; S-230 
toE-1189; H-231 to E-l 189; 1-232 to E-1189; E-233 to E-1189; Q-234to E-l 189; R-235 
to E-l 189; G-236 to E-l 1 89; G-237 to E-l 189;T-238 to E-l 189; E-239 to E-l 189; T-240 
to E-1189; R-241 toE-1 189; T-242 to E-l 189; A-243 to E-l 189; F-244 to E-l 189; G- 
245to E-l 189; 1-246 to E-l 189; E-247 to E-1189; F-248 to E-l 189;A-249 to E-l 189; R- 

20 250 to E-l 189; S-25 1 to E-l 189; E-252 toE-1 189; A-253 to E-l 189; F-254 to E-l 189; Q- 
255 to E-l 189; K-256to E-l 189; G-257 to E-l 189; G-258 to E-l 189; R-259 to E- 
1 189;K-260 to E-l 189; G-261 to E-l 189; A-262 to E-l 189; K-263 toE-1 189; K-264 to 
E-l 189; V-265 to E-l 189; M-266 to E-l 189; I-267to E-l 189; V-268 to E-l 189; 1-269 to 
E-l 189; T-270 to E-l 189;D-271 to E-l 189; G-272 to E-l 189; E-273 to E-l 189; S-274 

25 toE-1189; H-275 to E-l 189; D-276 to E-l 189; S-277 to E-l 189; P-278to E-l 189; D-279 
to E-l 189; L-280 to E-l 189; E-281 to E-l 189;K-282 to E-l 189; V-283 to E-l 189; 1-284 
to E-l 189; Q-285 toE-1 189; Q-286 to E-l 189; S-287 to E-l 189; E-288 to E-l 189; R- 
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289to E-l 189; D-290 to E-l 189; N-291 to E-l 1 89; V-292 to E-l 189;T-293 to E-l 189; 
R-294 to E-l 189; Y-295 to E-l 189; A-296 toE-1 189; V-297 to E-l 189; A-298 to E- 
1 189; V-299 to E-l 189; L-300to E-l 189; G-301 to E-l 189; Y-302 to E-l 189; Y-303 to 
E-l 189;N-304 to E-l 189; R-305 to E-l 189; R-306 to E-l 189; G-307 toE-1 189; 1-308 to 
E-l 189; N-309 to E-l 189; P-310 to E-l 189; E-31 Ito E-l 189; T-312 to E-l 189; F-313 to 
E-l 189; L-314 to E-l 189;N-315 toE-1 189; E-316 to E-1189; 1-317 to E-1189; K-318 
toE-1 189; Y-319 to E-l 189; 1-320 to E-l 189; A-321 to E-l 189; S-322to E-l 189; D-323 
to E-l 189; P-324 to E-l 189; D-325 to E-1189;D-326 to E-l 189; K-327 to E-l 189; H- 
328 to E-1189; F-329 toE-1 189; F-330 to E-l 189: N-331 to E-1189; V-332 to E-l 189; T- 
333to E-l 189; D-334 to E-l 189; E-335 to E-l 189: A-336 to E-l 189;A-337 to E-l 189; L- 
338 to E-l 189; K-339 to E-l 189; D-340 toE-1 189; 1-341 to E-l 189; V-342 to E-l 189; 
D-343 to E-l 189; A-344to E-l 189; L-345 to E-l 189; G-346 to E-l 189; D-347 to E- 
1189;R-348 to E-1189; 1-349 to E-l 189; F-350 to E-l 189; S-351 toE-1189; L-352 to E- 
1189; E-353 to E-1189; G-354 to E-l 189; T-355to E-1189; N-356 to E-l 189; K-357 to 
E-1189; N-358 to E-l 189;E-359 to E-1189; T-360 to E-1189; S-361 to E-1189; F-362 
toE-1 189; G-363 to E-l 189; L-364 to E-l 189; E-365 to E-l 189; M-366to E-l 1 89; S-367 
to E-1189; Q-368 to E-1189; T-369 to E-1189;G-370 to E-l 189; F-371 to E-l 189; S-372 
to E-l 189; S-373 toE-1 189; H-374 to E-1189; V-375 to E-1189; V-376 to E-l 189; E- 
377to E-1189; D-378 to E-l 189; G-379 to E-l 189; V-380 to E-1189;L-381 to E-l 189; L- 
382 to E-l 189; G-383 to E-l 189; A-384 toE-1 189; V-385 to E-l 189; G-386 to E-l 189; 
A-387 to E-l 189; Y-388to E-l 189; D-389 to E-l 189; W-390 to E-l 189; N-391 to E- 
1 189;G-392 to E-l 189; A-393 to E-l 189; V-394 to E-l 189; L-395 toE-1 189; K-396 to 
E-1189; E-397 to E-l 189; T-398 to E-1189; S-399to E-1189; A-400 to E-1189fG-401 to 
E-l 189; K-402 to E-l 189;V-403 to E-l 189; 1-404 to E-l 189; P-405 to E-l 189; L-406 
toE-1 189; R-407 to E-l 189; E-408 to E-l 189; S-409 to E-l 189; Y-410to E-l 189; L-41 1 
to E-1189; K-412 to E-l 189; E-413 to E-1189;F-4I4 to E-1189; P-415 to E-l 189; E-416 
to E-l 189; E-417 toE-1 189; L-418 to E-1189; K-419 to E-l 189; N-420 to E-l 189; H- 
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421to E-l 189; G-422 to E-l 189; A-423 to E-1189; Y-424 to E-l 189;L-425 to E-1189; 
G-426 to E-1189; Y-427 to E-1189; T-428 toE-1189; V-429 to E-1189; T-430 to E-l 189; 
S-431 to E-l 189; V-432to E-l 189; V-433 to E-1189; S-434 to E-l 189; S-435 to E- 
1 189;R-436 to E-l 189; Q-437 to E-l 189; G-438 to E-1 189; R-439 toE-1 189; V-440 to 
E-l 189; Y-441 to E-l 189; V-442 to E-l 189; A-443to E-l 189; G-444 to E-l 189; A-445 
to E-l 189; P-446 to E-l 189;R-447 to E-l 189; F-448 to E-l 189; N-449 to E-l 189; H-450 
toE-1 189; T-45 1 to E-l 189; G-452 to E-l 189; K-453 to E-l 189; V-454to E-l 189; 1-455 
to E-l 189; L-456 to E-l 189; F-457 to E-l 189; T-458to E-l 189; M-459 to E-l 189; H-460 
to E-l 189; N-461 to E-l 189;N-462 to E-l 189: R-463 to E-l 189; S-464 to E-l 189; L-465 
toE-1 189; T-466 to E-l 189; 1-467 to E-l 189; H-468 to E-l 189; Q-469to E-1189; A-470 
to E-l 189; M-471 to E-l 189; R-472 to E-l 189;G-473 to E-l 189; Q-474 to E-l 189; Q- 
475 to E-l 189; 1-476 toE-1 189; G-477 to E-1189; S-478 to E-1189; Y-479 to E-l 189; F- 
480to E-l 189; G-481 to E-l 189; S-482 to E-l 189; E-483 to E-l 189;I-484 to E-l 189; T- 
485 to E-l 189; S-486 to E-1189; V-487 toE-1 189; D-488 to E-l 189; 1-489 to E-l 189; D- 
490 to E- 1 1 89; G-49 1 to E- 1 1 89; D-492 to E- 1 1 89; G-493 to E- 1 1 89; V-494 to E- 
1 189;T-495 to E-l 189; D-496 to E-l 189; V-497 to E-1189; L-498 toE-1 189; L-499 to E- 
1 189; V-500 to E-l 189; G-501 to E-1189; A-502to E-l 189; P-503 to E-1189; M-504 to 
E-1189; Y-505 to E-l 189:F-506 to E-l 189; N-507 to E-l 189; E-508 to E-l 189; G-509 
toE- 1 1 89; R-5 1 0 to E- 1 1 89; E-5 1 1 to E- 1 1 89; R-5 1 2 to E- 1 1 89; G-5 13to E- 1 1 89; K-5 14 
to E-l 189; V-515 to E-l 189; Y-516 to E-1189;V-517 to E-1189; Y-518 to E-1189; E- 
519 to E-l 189; L-520 toE-1 189; R-521 to E-l 189; Q-522 to E-l 189; N-523 to E-l 189; 
R-524to E-l 189; F-525 to E-l 189; V-526 to E-l 189; Y-527 to E-l 189;N-528 to E-l 189; 
G-529 to E-l 189; T-530 to E-l 189; L-531 toE-1 189; K-532 to E-l 189; D-533 to E-l 189; 
S-534 to E-l 189; H-535to E-l 189; S-536 to E-l 189; Y-537 to E-l 189; Q-538 to E- 
1 189;N-539 to E-l 189; A-540 to E-l 189; R-541 to E-l 189; F-542 toE-1 189; G-543 to E- 
1 189; S-544 to E-1189; S-545 to E-1189; I-546to E-l 189; A-547 to E-1189; S-548 to E- 
1 189; V-549 to E-l 189;R-550 to E-l 189; D-551 to E-l 189; L-552 to E-l 189; N-553 
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toE-1 189; Q-554 to E-l 189; D-555 to E-l 189; S-556 to E-l 189; Y-557to E-1189; N-558 
to E-1189; D-559 to E-l 189; V-560 to E-1189;V-561 to E-l 189; V-562 to E-l 189; G- 
563 to E-l 189; A-564 toE-1 189; P-565 to E-l 189; L-566 to E-l 189; E-567 to E-l 189; 
D-568to E-l 189; N-569 to E-l 189; H-570 to E-l 189; A-571 to E-l 189;G-572 to E-l 189; 
A-573 to E-l 1 89; 1-574 to E-l 189; Y-575 toE-1 189; 1-576 to E-l 1 89; F-577 to E-l 189; 
H-578 to E-l 189; G-579to E-1189; F-580 to E-1189; R-581 to E-l 189; G-582 to E- 
1 189;S-583 to E-l 189; 1-584 to E-1189; L-585 to E-l 189; K-586 toE-1 189; T-587 to E- 
1 189; P-588 to E-1189: K-589 to E-l 189; Q-590to E-1189; R-591 to E-l 189; 1-592 to E- 
1 189; T-593 toE-1189;A-594 to E-1189; S-595 to E-1189; E-596 to E-1189; L-597 toE- 
1 189; A-598 to E- 1 189; T-599 to E-l 189; G-600 to E-l 189; L-601to E-l 189; Q-602 to 
E- 1 1 89; Y-603 to E-1189; F-604 to E-l 1 89;G-605 to E- 1 1 89; C-606 to E- 1 1 89; S-607 to 
E-l 189; 1-608 toE-1 189; H-609 to E-l 189; G-610 to E-l 189; Q-61 1 to E-l 189; L-612to 
E-l 189; D-613 to E-l 189; L-614 to E-l 189; N-615 to E-l 189;E-616 to E-1189; D-617 to 
E-l 189; G-618 to E-l 189; L-619 toE-1189; 1-620 to E-l 189; D-621 to E-l 189; L-622 to 
E-l 189; A-623to E-l 189; V-624 to E-l 1 89; G-625 to E-l 189; A-626 to E-l 189;L-627 to 
E-1189; G-628 to E-1189; N-629 to E-l 189; A-630 toE-1189; V-631 to E-l 189; 1-632 to 
E-l 189; L-633 to E-l 189; W-634to E-l 189; S-635 to E-l 189; R-636 to E-l 189; P-637 to 
E-1189; V-638 to E-l 189; V-639 to E-1189; Q-640 to E-1189; 1-641 toE-1 189; N-642 to 
E-l 189; A-643 to E-l 189; S-644 to E-l 189; L-645to E-l 189; H-646 to E-l 189; F-647 to 
E-1189; E-648 toE-1189;P-649 to E-1189; S-650 to E-1189; K-651 to E-1189; 1-652 
toE-1 189; N-653 to E-l 189; 1-654 to E-l 189; F-655 to E-l 189; H-656to E-l 1 89; R-657 
to E-1189; D-658 to E-1189; C-659 to E-l 189;K-660 to E-l 189; R-661 to E-l 189; S-662 
to E- 1 1 89; G-663 toE- 1 1 89; R-664 to E- 1 1 89; D-665 to E-l 1 89; A-666 to E- 1 1 89; T- 
667to E-l 189; C-668 to E-l 189; L-669 to E-l 189; A-670 to E-l 189;A-671 to E-l 189; F- 
672 to E-l 189; L-673 to E-l 189; C-674 toE-1 189; F-675 to E-l 189; T-676 to E-l 189; P- 
677 to E-1189; 1-678 toE-1189; F-679 to E-1189; L-680 to E-1189; A-681 to E-1189; P- 
682to E-l 189; H-683 to E-l 189; F-684 to E-l 189; Q-685 to E-l 189;T-686 to E-l 189; T- 
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687 to E-1189; T-688 to E-1189; V-689 toE-1189; G-690 to E-l 189; 1-691 to E-l 189; R- 
692 to E-l 189; Y-693to E-l 189; N-694 to E-1189; A-695 to E-1189; T-696 to E- 
1 189;M-697 to E-l 189; D-698 to E-1189; E-699 to E-l 189; R-700 toE-1189; R-701 to 
E-l 189; Y-702 to E-l 189; T-703 to E-l 189; P-704to E-l 189; R-705 to E-l 189; A-706 to 
E-l 189; H-707 to E-l 189;L-708 to E-l 189; D-709 to E-l 189; E-710 to E-l 189; G-71 1 
toE-1 189; G-712 to E-l 189; D-713 to E-l 189; R-714 to E-l 189; F-715to E-l 189; T-716 
to E-1189; N-717 to E-l 189; R-718 to E-1189;A-719 to E-1189; V-720 to E-1189; L- 
72 1 to E- 1 1 89; L-722 toE- 1189; S-723 to E- 1 1 89; S-724 to E- 1 1 89; G-725 to E-l 1 89; Q- 
726to E-l 189; E-727 to E-l 189; L-728 to E-l 189; C-729 to E-l 189;E-730 to E-l 189; R- 
731 to E-l J 89; 1-732 to E-l 189; N-733 toE-1 189; F-734 to E-l 189; H-735 to E-l 189; V- 
736 to E-l 189; L-737to E-l 189; D-738 to E-l 189; T-739 to E-l 189; A-740 to E-l 189;D- 
741 to E-l 189; Y-742 to E-l 189; V-743 to E-l 189; K-744 toE-1 189; P-745 to E-1189; 
V-746 to E-l 189; T-747 to E-l 189; F-748to E-l 189; S-749 to E-l 189; V-750 to E-l 189; 
E-751 to E-l 189;Y-752 to E-l 189; S-753 to E-l 189; L-754 to E-l 189; E-755 toE-1 189; 
D-756 to E-l 189; P-757 to E-l 189; D-758 to E-l 189; H-759to E-l 189; G-760 to E- 
1189; P-761 to E-1189; M-762 to E-l 189;L-763 to E-l 189; D-764 to E-l 189; D-765 to 
E-l 189; G-766 toE-1189; W-767 to E-1189; P-768 to E-l 189; T-769 to E-1189; T-770to 
E-l 189; L-771 to E-l 189: R-772 to E-l 189; V-773 to E-l 189;S-774 to E-l 189; V-775 to 
E-1189; P-776 to E-1189; F-777 toE-1189; W-778 to E-l 189; N-779 to E-1189; G-780 
to E-l 189; C-781to E-l 189; N-782 to E-1189; E-783 to E-l 189; D-784 to E-l 189;E-785 
to E-l 189; H-786 to E-l 189; C-787 to E-l 189; V-788 toE-1189; P-789 to E-l 189; D-790 
to E-l 189; L-791 to E-l 189; V-792to E-l 189; L-793 to E-l 189; D-794 to E-l 189; A-795 
to E-l I89;R-796 to E-l 189; S-797 to E-l 189; D-798 to E-l 189; L-799 toE-1189; P-800 
to E-l 189; T-801 to E-l 189; A-802 to E-l 189; M-803to E-l 189; E-804 to E-l 189; Y- 
805 to E-l 189; C-806 to E-l 189;Q-807 to E-1189; R-808 to E-l 189; V-809 to E-l 189; 
L-810 toE-1 189; R-81 1 to E-l 189; K-812 to E-1189; P-813 to E-l 189; A-814to E-1189; 
Q-815 to E-1 189; D-816 to E-l 189; C-817 to E-l 189;S-818 to E-1189; A-819 to E-l 189; 
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Y-820 to E-1189; T-821 toE-1189; L-822 to E-1189; S-823 to E-l 189; F-824 to E-1189; 
D-825to E-l 189; T-826 to E-l 189; T-827 to E-l 189; V-828 to E-l 189;F-829 to E-l 189; 
1-830 to E-l 189; 1-831 to E-l 189; E-832 to E-l 189;S-833 to E-l 189; T-834 to E-l 189; 
R-835 to E-1189; Q-836 toE-1189; R-837 to E-1189; V-838 to E-1189; A-839 to E- 
5 1189; V-840to E-l 189; E-841 to E-l 189: A-842 to E-l 189; T-843 to E-l 189;L-844 to E- 
1 189; E-845 to E-l 189; N-846 to E-l 189; R-847 toE-1 189; G-848 to E-J 189; E-849 to 
E-l 189; N-850 to E-l 189; A-851to E-l 189; Y-852 to E-1189; S-853 to E-1189; T-854 to 
E-l 189;V-855 to E-l 189; L-856 to E-1189; N-857 to E-l 189; 1-858 toE-1 189; S-859 to 
E-1189; Q-860 to E-1189; S-861 to E-1189; A-862toE-1189; N-863 to E-1189; L-864 to 
10 E-l 189; Q-865 to E-l 189;F-866 to E-l 189; A-867 to E-l 1 89; S-868 to E-l 1 89; L-869 

toE- 1 1 89; 1-870 to E- 1 1 89; Q-87 1 to E- 1 1 89; K-872 to E- 1 1 89; E-87310 E- 1 1 89; D-874 
to E-l 189; S-875 to E-l 189; D-876 to E-l 189;G-877 to E-l 189; S-878 to E-l 189; 1-879 
to E-l 189; E-880 toE-1 189; C-881 to E-l 189; V-882 to E-1189; N-883 to E-l 189; E- 
884to E-l 189; E-885 to E-1189; R-886 to E-l 189: R-887 to E-l 189;L-888 to E-l 189; Q- 

15 889 to E-l 189; K-890 to E-l 189; Q-891 toE-1 189: V-892 to E-l 189; C-893 to E-1189; 
N-894 to E-l 189; V-895to E-l 189; S-896 to E-l 189; Y-897 to E-l 189; P-898 to E- 
1189;F-899 to E-1189; F-900 to E-1189;R-901 to E-1189; A-902toE-1189; K-903toE- 
1189; A-904 to E-l 189; K-905 to E-1189; V-906to E-l 189; A-907 to E-1189; F-908 to 
E-l 189; R-909 to E-l 189;L-910 to E-l 189; D-91 1 to E-l 189; F-912 to E-l 189; E-913 

20 toE-1189; F-914 to E-1189; S-915 to E-1189; K-916 to E-l 189; S-917to E-l 189; 1-918 
to E-l 189; F-919 to E-l 189; L-920 to E-l 189;H-921 to E-l 189; H-922 to E-l 189; L-923 
to E-1189; E-924 toE-1189; 1-925 to E-1189; E-926 to E-1189; L-927 to E-l 189; A- 
928to E-l 189; A-929 to E-l 189; G-930 to E-l 189; S-931 to E-l 189;D-932 to E-l 189; S- 
933 to E-l 189; N-934 to E-l 189; E-935 toE-1 189; R-936 to E-l 189; D-937 to E-l 189; 

25 S-938 to E-l 189; T-939to E-l 189; K-940 to E-l 189; E-941 to E-l 189; D-942 to E- 

1 189;N-943 to E-l 189; V-944 to E-l 1 89; A-945 to E-l 189; P-946 toE-1 189; L-947 to fi- 
ll 89; R-948 to E-l 189; F-949 to E-l 189; H-950to E-l 189; L-951 to E-l 189; K-952 to 
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E-1189; Y-953 to E-ll89;E-954 to E-1189; A-955 to E-l 189; D-956 to E-1189; V-957 
toE-1 189; L-958 to E-l 189; F-959 to E-1189; T-960 to E-l 189; R-961to E-1189; S-962 
to E-l 189; S-963 to E-l 189; S-964 to E-1189;L-965 to E-l 189; S-966 to E-1189; H-967 
to E- 1 1 89; Y-968 toE-1 1 89; E-969 to E-l 1 89; V-970 to E-l 1 89; K-97 1 to E-l 189; L- 
972to E-l 1 89; N-973 to E-l 189; S-974 to E-l 189; S-975 to E-l 189;L-976 to E-l 189; E- 
977 to E-l 189; R-978 to E-l 189; Y-979 toE-1 189; D-980 to E-l 189; G-981 to E-l 189; I- 
982 to E-l 189; G-983to E-1189; P-984 to E-l 189; P-985 to E-1189; F-986 to E-l 189;S- 
987 to E-l 189; C-988 to E-1189; 1-989 to E-l 189; F-990 toE-1189; R-991 to E-l 189; 1- 
992 to E-l 189; Q-993 to E-l 189; N-994to E-l 189; L-995 to E-l 189; G-996 to E-l 189; 
L-997 to E-1189;F-998 to E-l 189; P-999 to E-l 189; 1-1000 to E-l 189; H-1001 toE- 
1 189: G-1002 to E-l 189; 1-1003 to E-l 189; M-1004 to E-l I89;M-1005 to E-l 189; K- 
1006 to E-l 189; 1-1007 to E-l 189; T-1008 toE-1 189; 1-1009 to E-l 189; P-1010 to E- 
1 189; 1-1011 to E-l 189;A-1012 to E-l 189; T-1013 to E-l 189; R-1014 to E-1189; S-1015 
toE-1189; G-1016 to E-l 189; N-1017 to E-l 189; R-1018 to E-l 189;L-1019 to E-1189; 
L-1020 to E-1189; K-1021 to E-1189; L-1022 toE-1189; R-1023 to E-l 189; D-1024 to 
E-l 189; F-1025 to E-1189;L-1026 to E-l 189; T-1027 to E-l 189; D-1028 to E-l 189; E- 
1029 toE-1 189; V-1030toE-1189; A-1031 to E-1189; N-1032 to E-1189;T-1033 to E- 
1 189; S-1034 to E-1189; C-1035 to E-1189; N-1036 toE-1 189; 1-1037 to E-l 189; W- 
1038 to E-l 189; G-1039 to E-l 189;N-1040 to E-l 189; S-1041 to E-l 189; T-1042 to E- 
1 189; E-1043 toE-1 189; Y-1044 to E-l 189; R-1045 to E-l 189; P-1046 to E-l 189;T- 
1047 to E-l 189; P-1048 to E-l 189; V-1049 to E-l 189; E-1050 toE-1189; E-1051 to E- 
1 189; D-1052 to E-l 189; L-1053 to E-l 189;R-1054 to E-l 189; R-1055 to E-1189; A- 
1056 to E-l 189; P-1057 toE-1 189; Q-1058 to E-l 189; L-1059 to E-l 189; N-1060 to E- 
1189;H-1061 to E-1189; S-1062 to E-1189; N-1063 to E-l 189; S-1064 toE-1 189; D- 
1065 to E-l 189; V-1066 to E-l 189; V-1067 to E-1189;S-1068 to E-l 189; 1-1069 to fi- 
lls* N-1070 to E-1189; C-1071 toE-1189; N-1072 to E-1189; 1-1073 to E-1189; R- 
1074 to E-l 189;L-1075 to E-l 189; V-1076 to E-l 189; P-1077 to E-l 189; N-1078 toE- 
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1189; Q-1079 to E-l 189; E-1080 to E-l 189; 1-1081 to E-l 189;N-1082 to E-1189; F- 
1083 to E-l 189; H-1084 to E-1189; L-1085 toE-1189; L-1086 to E-l 189; G-1087 to E- 
1189; N-1088 to E-1189;L-1089 to E-1189; W-1090 to E-l 189; L-1091 to E-1189; R- 
1092 toE-1 189; S-1093 to E-l 189; L-1094 to E-l 189; K-1095 to E-l 189;A-1096 to E- 
1189: L-1097 to E-l 189; K-1098 to E-l 189; Y-1099 toE-1189; K-l 100 to E-1189: S- 
1 101 to E-l 189; M-l 102 to E-l 189;K-1 103 to E-l 1 89; 1-1 104 to E- 1 189; M-l 105 to fi- 
ll 89; V-l 106 toE-l 189; N-l 107 to E-l 189; A- 1 108 to E-l 189; A-1109 to E-l 189;L- 
1110 to E-l 189; Q-llll to E-1189; R-l 112 to E-l 189; Q-1113 toE-1 189; F-1114 to E- 
1189; H-l 115 to E-l 189; S-1116 to E- 1 1 89:P-1 1 17 to E-1189; F-ll 18 to E-1189; 1-1 1 19 
to E-l 189: F-l 120 toE-1 189; R-l 121 to E-l 189; E-l 122 to E-l 189; E-l 123 to E-1189;D- 
1124 to E-1189; P-1125 to E-1189; S-1126 to E-1189; R-l 127 toE-1189; Q-1128 to E- 
1 189; I- 1 129 to E-l 189; V-l 130 to E-l 1 89;F-1 131 to E-l 189; E-l 132 to E-l 189; 1-1 133 
to E-1189; S-l 134 toE-1 189; K-l 135 to E-l 189; Q- 11 36 to E-1189; E-l 137 to E- 
1 189;D-1 138 to E-l 189; W-l 139 to E-l 189; Q-l 140 to E-l 189; V-l 141 toE-1 189; P- 
1142 to E-l 189: 1-1 143 to E-l 189; W-l 144 to E-l 189;I-1 145 to E-l 189; 1-1 146 to E- 
1189; V-l 147 to E-1189; G-1148 toE-1189; S-l 149 to E-l 189; T-l 150 to E-l 189; L- 
1151toE-1189;G-1152toE-1189; G-1153 to E-1189; L-1154 to E-1189; L-1155 toE- 
1 189; L-l 156 to E-l 189; L-1157 to E-l 189; A-l 158 to E-l 189;L-1 159 to E-l 189; L- 
1160 to E-1189; V-l 161 to E-1189; L-l 162 toE-1 189; A-l 163 to E-1189; L-l 164 toE- 
1 189; W-l 165 to E-l 189;K-1 166 to E-l 189; L-1167 to E-l 189; G-l 168 to E-1189; F- 
1169toE-1189; F-l 170 to E-l 189; R-l 171 to E-1189; S-l 172 to E-1189;A-1173 to fi- 
ll 89; R-l 174 to E-l 189; R-l 175 to E-l 189; R-l 176 toE-1189; R-l 177 to E-l 189; E- 
1 178 to E-1189; P-1179 to E-l 189;G-1 180 to E-1189; L-l 181 to E-l 189; D-l 182 to E- 
1 189; P-l 183 toE-1 189; T-l 184 to E-l 189; of SEQ ID NO:35. Polypeptides encoded by 
these polynucleotides are also encompassed by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
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functions of the protein, other functional activities (e.g., biological activities (e.g., ability 
to illicit mitogenic activity, induce differentiation of normal or malignant cells, ability to 
multimerize, etc.) may still be retained. For example the ability of the shortened al 1 
mutein to induce and/or bind to antibodies which recognize the complete or mature forms 
of the polypeptide generally will be retained when less than the majority of the residues 
of the complete or mature polypeptide are removed from the C-terminus. Whether a 
particular polypeptide lacking C-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that an al 1 mutein with a large number 
of deleted C-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six al 1 amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the al 1 
polypeptide shown in Figures 19A-F, up to the glycine residue at position number 6, and 
polynucleotides encoding such polypeptides. In particular, the present invention 
provides polypeptides comprising the amino acid sequence of residues 1-ml of Figures 
19A-F, where ml is an integer from 6 to 1 189 corresponding to the position of the amino 
acid residue in Figures 19A-F. Moreover, the invention provides polynucleotides 
encoding polypeptides comprising, or alternatively consisting of, the amino acid 
sequence of C-terminal deletions of the al 1 polypeptide of the invention shown as SEQ 
ID NO:35 include polypeptides comprising the amino acid sequence of residues: M-l to 
L-1188;M-1 toV-1187; M-l toK-1186;M-l toP-U85;M-l toT-1184;M-l toP-1183; 
M-l toD-1182; M-l to L-l 181; M-l to G-1180; M-l toP-1179; M-l toE-1178; M-l to 
R-l 177; M-l to R-l 176; M-l to R-l 175; M-l to R-1174; M-l to A-1173; M-lto S-l 172; 
M-l toR-1171;M-l toF-1170; M-l toF-1169;M-l toG-1168;M-l to L-l 167; M-l to 
K-1166;M-1 to W-1165; M-l to L-l 164; M-l to A-1163; M-l to L-l 162; M-l to V- 
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1 161; M-l to L-l 160; M-l toL-1 159; M-l to A-l 158; M-l to L-l 157; M-l to L-1156; 
M-l to L-l 155; M-l to L-l 154; M-l to G-l 153; M-lto G-l 152; M-l to L-l 151; M-l to 
T-1150;M-1 toS-1149;M-l toG-1148;M-l toV-1147;M-l toI-1146;M-l to 1-1145; 
M-l to W-1144;M-1 to 1-1 143; M-l toP-1142;M-l to V-1141;M-1 toQ-1140;M-l 
toW-1 139; M-l to D-l 138; M-l to E-l 137; M-l to Q-l 136; M-l to K-l 135; M-l to S- 
1134; M-l to 1-1133; M-lto E-l 132; M-l to F-l 131; M-l to V-l 130; M-l to 1-1129; M- 
1 to Q-l 128; M-l to R-l 127; M-l to S-l 126;M-1 to P-l 125; M-l to D-l 124; M-l to E- 
1 123; M-l to E-l 122; M-l to R-l 121; M-l to F-l 120; M-l tol-l 119; M-l to F-l 118; M- 
1 to P-l 1 17; M-l to S-l 1 16; M-l to H-l 1 15; M-l to F-l 1 14; M- 1 to Q-l 1 13; M-lto R- 
1 1 12; M-l to Q-l 1 1 1 ; M-l to L-l 1 10: M-l to A-l 109; M-l to A-l 108; M-l to N-l 107; 
M-l to V-1106;M-I to M-l 105; M-l to 1-1104; M-l to K-l 103; M-l to M-l 102; M-l to 
S-l 101 ; M-l to K-1100: M-l toY-1099; M-l to K-1098; M-l to L-1097; M-l to A-1096; 
M-l toK-1095;M-l toL-1094; M-l to S-1093; M-lto R-1092; M-l toL-1091;M-l to 
W-1090; M-I to L-1089; M-l to N-1088: M-l to G-1087; M-l to L-1086;M-1 to L- 
1085; M-l to H-1084; M-l to F-1083; M-l to N-1082; M-l to 1-1081; M-l to E-1080; 
M-l toQ-1079; M-l to N-1078; M-l to P-1077; M-l to V-1076; M-l to L-1075; M-l to 
R-1074;M-1 to 1-1073; M-lto N-1072; M-l toC-1071;M-l toN-1070; M-l to 1-1069; 
M-l to S-1068; M-l to V-1067; M-l to V-1066;M-1 to D-1065; M-l to S-1064; M-l to 
N-1063;M-ltoS-1062;M-l toH-1061;M-l toN-1060; M-l toL-1059;M-l toQ-1058; 
M-l toP-1057;M-l toA-1056; M-l toR-1055;M-l toR-1054; M-l to L-1053; M-lto 
D-1052; M-l to E-1051; M-l to E-1050; M-l to V-1049; M-l to P-1048; M-l toT-1047; 
M-l to P-1046;M-1 to R-1045; M-l to Y-1044; M-l to E-1043; M-l to T-1042; M-l to 
S-1041; M-l to N-1040; M-l toG-1039; M-l to W-1038; M-l to 1-1037; M-l to N-1036; 
M-l to C-1035; M-l to S-1034; M-l to T-1033; M-lto N-1032; M-l to A-1031; M-l to 
V-1030; M-l to E-1029; M-l to D-1028; M-l to T-1027; M-l to L-1026;M-1 to F-1025; 
M-l toD-1024; M-l toR-1023; M-l toL-1022; M-l toK-1021;M-l toL-1020; M-l 
toL-1019; M-l to R-1018; M-l to N-1017; M-l to G-1016; M-l to S-1015; M-l to R- 
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1014; M-l toT-1013; M-ho A-1012; M-l to 1-1011; M-l to P-1010; M-l to 1-1009; M- 
1 toT-1008; M-l to 1-1007; M-l to K-1006; M-lto M-1005; M-l to M-1004; M-l to I- 
1003; M-l to G-1002; M-l to H-1001; M-l to 1-1000; M-l to P-999;M-1 to F-998; M-l 
to L-997; M-l to G-996; M-l to L-995; M-l to N-994; M-l to Q-993; M-l to 1-992; M-l 
5 toR-991; M-l to F-990; M-l to 1-989; M-l to C-988; M-l to S-987; M-l to F-986; M-l 
to P-985; M-l to P-984;M-1 to G-983; M-l to 1-982; M-l to G-981; M-l to D-980; M-l 
to Y-979; M-l to R-978; M-l to E-977; M-l toL-976; M-l to S-975; M-l to S-974; M-l 
to N-973; M-l to L-972; M-l to K-971; M-l to V-970; M-l toE-969; M-l to Y-968; M-l 
to H-967; M-l to S-966; M-l to L-965; M-l to S-964; M-l to S-963; M-l to S-962;M-1 

10 to R-961; M-l toT-960; M-l to F-959; M-l toL-958: M-l to V-957; M-l to D-956; M-l 
to A-955; M-l toE-954; M-l to Y-953; M-l to K-952; M-l to L-951; M-l to H-950; M-l 
to F-949; M-l to R-948; M-l toL-947; M-l to P-946; M-l to A-945; M-l to V-944; M-l 
to N-943; M-l to D-942; M-l to E-941; M-l toK-940; M-l toT-939; M-l to S-938; M-l 
to D-937; M-l to R-936; M-l to E-935; M-l to N-934; M-l toS-933; M-l to D-932; M-l 

15 to S-931; M-l to G-930; M-l to A-929; M-l to A-928; M-l to L-927; M-l toE-926; M-l 
to 1-925; M-l to E-924; M-l to L-923; M-l to H-922; M-l to H-921; M-l to L-920; M-l 
to F-919;M-1 to 1-918; M-l to S-917; M-l to K-916; M-l to S-915; M-l to F-914; M-l 
toE-913;M-l toF-912;M-l toD-911;M-l toL-910; M-l toR-909;M-l toF-908;M-l 
to A-907; M-l to V-906; M-l to K-905; M-l toA-904; M-l to K-903; M-l to A-902; M- 

20 1 to R-901; M-l to F-900; M- 1 to F-899; M-l to P-898; M-l toY-897; M-l to S-896; M- 
1 to V-895; M-l to N-894; M-l to C-893; M-l to V-892; M-l to Q-891; M-l toK-890; 
M-l to Q-889; M-l to L-888; M-l to R-887; M-l to R-886; M-l to E-885; M-l to E-884; 
M-l toN-883; M-l to V-882; M-l to C-881; M-l to E-880; M-l to 1-879; M-l to S-878; 
M-l to G-877; M-l to D-876;M-1 to S-875; M-l to D-874; M-l to E-873; M-l to K-872; 

25 M-l to Q-871; M-l to 1-870; M-l to L-869; M-l toS-868; M-l to A-867; M-l to F-866; 
M-l to Q-865; M-l to L-864; M-l to N-863; M-l to A-862; M-l toS-861; M-l to Q-860; 
M-l to S-859; M-l to 1-858; M-l to N-857; M-l to L-856; M-l to V-855; M-l to T- 
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854;M-1 to S-853; M-l to Y-852; M-l to A-851; M-l to N-850; M-l to E-849; M-l to 
G-848; M-l to R-847; M-lto N-846; M-l to E-845; M-l to L-844; M-l to T-843; M-l to 
A-842; M-l to E-841; M-l to V-840; M-l toA-839; M-l to V-838; M-l to R-837; M-l to 
Q-836; M-l to R-835; M-l to T-834; M-l to S-833; M-l toE-832; M-l to 1-831; M-l to 
5 1-830; M-l to F-829; M-l to V-828; M-l to T-827; M-l to T-826; M-l to D-825;M-1 to 
F-824;M-1 toS-823;M-l toL-822;M-l toT-821;M-l to Y-820; M-l to A-819; M-l to 
S-818; M-l toC-817; M-l to D-816; M-l to Q-815; M-l to A-814; M-l to P-813; M-l to 
K-812; M-l to R-811; M-l toL-810; M-l to V-809; M-l to R-808; M-l to Q-807; M-l to 
C-806; M-l to Y-805; M-l to E-804; M-l toM-803; M-l to A-802; M-l to T-801; M-l 
10 to P-800; M-l to L-799; M-l to D-798; M-l to S-797; M-l toR-796: M-l to A-795; M-l 
to D-794; M-l to L-793; M-l to V-792; M-l to L-791; M-I to D-790; M-l toP-789; M-l 
to V-788; M-l to C-787; M-l to H-786; M-l to E-785; M-l to D-784; M-l to E-783; M- 
1 toN-782; M-l to C-781; M-l to G-780; M-l to N-779; M-l to W-778; M-l to F-777; 
M-l to P-776; M-l toV-775; M-l to S-774; M-l to V-773; M-l to R-772; M-l to L-771; 
15 M-l to T-770; M-l to T-769; M-l to P-768;M-1 to W-767; M-l to G-766; M-l to D-765; 
M-l to D-764; M-l to L-763; M-l to M-762; M-l to P-761; M-lto G-760; M-l to H- 
759; M-l to D-758; M-l to P-757; M-l to D-756; M-l to E-755: M-l to L-754; M-l toS- 
753; M-l to Y-752; M-l to E-751; M-l to V-750; M-l to S-749; M-l to F-748; M-l to 
T-747; M-l to V-746;M-1 to P-745; M-l to K-744; M-l to V-743; M-l to Y-742; M-l to 
20 D-741; M-l to A-740; M-l to T-739; M-lto D-738; M-l to L-737; M-l to V-736; M-l to 
H-735; M-l to F-734; M-l to N-733; M-l to 1-732; M-l toR-731; M-l to E-730; M-l to 
C-729; M-l to L-728; M-l to E-727; M-l to Q-726; M-l to G-725; M-l toS-724; M-l to 
S-723; M-l to L-722; M-l to L-721; M-l to V-720; M-l to A-719; M-l to R-718; M-l 
toN-717; M-l to T-716; M-l to F-715; M-l to R-714; M-l to D-713; M-l to G-712; M-l 
25 to G-711; M-l toE-710; M-l to D-709; M-l to L-708; M-l to H-707; M-l to A-706; M-l 
to R-705; M-l to P-704; M-l toT-703; M-l to Y-702; M-l to R-701; M-l to R-700; M-l 
to E-699; M-l to D-698; M-l to M-697; M-l toT-696; M-l to A-695; M-l to N-694; M- 
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1 to Y-693; M-l to R-692; M-l to 1-691; M-l to G-690; M-l toV-689; M-l to T-688; M- 

1 to T-687; M-l to T-686; M-l to Q-685; M-l to F-684; M-l to H-683; M-l to P-682;M- 

1 to A-681; M-l to L-680; M-l to F-679; M-l to 1-678; M-l to P-677; M-l to T-676; M- 

1 to F-675; M-l toC-674; M-l to L-673; M-l to F-672; M-l to A-671; M-l to A-670; M- 

1 to L-669; M-l to C-668; M-l to T-667;M-1 to A-666; M-l to D-665; M-l to R-664; 

M-l to G-663; M-l to S-662; M-l to R-661; M-l to K-660; M-lto C-659; M-l to D-658; 

M-l to R-657; M-l to H-656; M-l to F-655; M-l to 1-654; M-l to N-653; M-l toI-652; 

M-l to K-651; M-l to S-650; M-l to P-649; M-l to E-648; M-l to F-647; M-l to H-646; 

M-l to L-645;M-1 to S-644; M-l to A-643; M-l to N-642; M-l to 1-641; M-l to Q-640; 

M-l to V-639; M-l to V-638; M-l toP-637; M-l to R-636; M-l to S-635; M-l to W-634; 

M-l to L-633; M-l to 1-632; M-l to V-631; M-l to A-630;M-1 to N-629; M-l to G-628; 

M-l to L-627; M-l to A-626; M-l to G-625; M-l to V-624; M-l to A-623; M-lto L-622; 

M-l to D-621; M-l to 1-620; M-l to L-619; M-l to G-618; M-l to D-617; M-l to E-616; 

M-l toN-615;M-l toL-614; M-l toD-613;M-l toL-612;M-l toQ-611;M-l toG-610; 

M-l to H-609; M-l tol-608; M-l to S-607; M-l to C-606; M-l to G-605; M-l to F-604; 

M-l to Y-603; M-l to Q-602; M-l to L-601;M-1 to G-600; M-l to T-599; M-l to A-598; 

M-l to L-597; M-l to E-596; M-l to S-595; M-l to A-594; M-l toT-593; M-l to 1-592; 
M-l to R-591; M-l to Q-590; M-l to K-589; M-l to P-588; M-l to T-587; M-l to K- 
586;M-1 to L-585; M-l to 1-584; M-l to S-583; M-l to G-582; M-l to R-581; M-l to F- 
580; M-l to G-579; M-l toH-578; M-l to F-577; M-l to 1-576; M-l to Y-575; M-l to I- 
574; M-l to A-573; M-l to G-572; M-l to A-571;M-1 to H-570; M-l to N-569; M-l to 
D-568; M-l to E-567; M-l to L-566; M-l to P-565; M-l to A-564; M-lto G-563; M-l to 
V-562; M-l to V-561; M-l to V-560; M-l to D-559; M-l to N-558; M-l to Y-557; M-l 
toS-556; M-l to D-555; M-l to Q-554; M-l to N-553; M-l to L-552; M-l to D-551; M-l 
to R-550; M-l toV-549; M-l to S-548; M-l to A-547; M-l to 1-546; M-l to S-545; M-l 
to S-544; M-l to G-543; M-l to F-542;M-1 to R-541; M-l to A-540; M-l to N-539; M-l 
to Q-538; M-l to Y-537; M-l to S-536; M-l to H-535; M-lto S-534; M-l to D-533; M-l 
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to K-532; M-l to L-531; M-l to T-530; M-l to G-529; M-l to N-528; M-l toY-527; M-l 
to V-526; M-l to F-525; M-l to R-524; M-l to N-523; M-l to Q-522; M-l to R-521; M- 
1 toL-520; M-l toE-519; M-l to Y-518; M-l to V-517; M-l to Y-516; M-l to V-515; 
M-l to K-514; M-l toG-513; M-l to R-512; M-l to E-51 1; M-l to R-510; M-l to G-509; 
M-l to E-508; M-l to N-507; M-l toF-506; M-l to Y-505; M-l to M-504; M-l to P-503; 
M-l to A-502; M-l to G-501; M-l to V-500; M-l toL-499; M-l to L-498; M-l to V-497; 
M-l to D-496; M-l to T-495; M-l to V-494; M-l to G-493: M-l toD-492; M-l to G- 
491; M-l to D-490; M-l to 1-489; M-l to D-488; M-l to V-487; M-l to S-486; M-l to T- 
485;M-1 to 1-484; M-l to E-483; M-l to S-482; M-l to G-481; M-l to F-480; M-l to Y- 
479; M-l to S-478; M-l toG-477; M-l to 1-476; M-l to Q-475; M-l to Q-474; M-l to G- 
473; M-l to R-472; M-l to M-471; M-l toA-470; M-l to Q-469; M-l to H-468; M-l to 
1-467; M-l to T-466; M-l to L-465; M-l to S-464; M-l to R-463;M-l to N-462; M-l to 
N-461; M-l to H-460; M-l to M-459; M-l to T-458; M-l to F-457; M-l to L-456; M-lto 
1-455; M-l to V-454; M-l to K-453; M-l to G-452; M-l toT-451; M-l to H-450; M-l to 
N-449; M-l toF-448; M-l to R-447; M-l to P-446; M-l to A-445; M-l to G-444; M-l to 
A-443; M-l to V-442; M-l toY-441; M-l to V-440; M-l to R-439; M-l to G-438; M-l 
to Q-437; M-l to R-436; M-l to S-435; M-l toS-434; M-l to V-433; M-l to V-432; M-l 
to S-431; M-l to T-430; M-l to V-429; M-l to T-428; M-l to Y-427;M-1 to G-426; M-l 
to L-425; M-l to Y-424; M-l to A-423; M-l to G-422; M-l to H-421; M-l to N-420; M- 
ltoK-419; M-l toL-418; M-l toE-417;M-l toE-416; M-l toP-415;M-l toF-414; M- 
1 to E-413; M-l toK-412; M-l to L-41 1 ; M-l to Y-410; M-l to S-409; M-l to E-408; M- 
1 to R-407; M-l to L-406; M-l to P-405;M-1 to 1-404; M-l to V-403; M-l to K-402; M- 
1 to G-401; M-l to A-400; M-l to S-399; M-l to T-398; M-l toE-397; M-l to K-396; 
M-l to L-395; M-l to V-394; M-l to A-393; M-l to G-392; M-l to N-391; M-l toW- 
390; M-l to D-389; M-l to Y-388; M-l to A-387; M-l to G-386; M-l to V-385; M-l to 
A-384; M-l toG-383; M-l to L-382; M-l to L-381; M-l to V-380; M-l to G-379; M-l to 
D-378; M-l to E-377; M-l toV-376; M-l to V-375; M-l to H-374; M-l to S-373; M-l to 
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S-372; M-l to F-371; M-l to G-370; M-l toT-369; M-l to Q-368; M-l to S-367; M-l to 
M-366; M-l to E-365; M-l to L-364; M-l to G-363; M-l toF-362; M-l to S-361; M-l to 
T-360; M-l to E-359; M-l to N-358; M-l to K-357; M-l to N-356; M-l toT-355; M-l to 
G-354; M-l to E-353; M-l to L-352; M-l to S-351; M-l to F-350; M-l to 1-349; M-l to 
5 R-348;M-1 to D-347; M-l to G-346; M-l to L-345; M-l to A-344; M-l to D-343; M-l to 
V-342; M-l to 1-341; M-l toD-340; M-l to K-339; M-l to L-338; M-l to A-337; M-l to 
A-336; M-l to E-335; M-l toD-334; M-l toT-333; M-l to V-332; M-l to N-331; M-l to 
F-330; M-l to F-329: M-l to H-328; M-l to K-327; M-l toD-326; M-l to D-325; M-l to 
P-324; M-l to D-323; M-l to S-322; M-l to A-321; M-l to 1-320; M-l to Y-319:M-1 to 
10 K-318;M-1 to 1-317; M-l toE-316;M-l toN-3l5;M-l toL-314;M-l toF-313;M-l to 
T-312; M-l toE-311;M-l toP-310; M-l to N-309; M-l to 1-308; M-l toG-307;M-l to 
R-306; M-l to R-305; M-l toN-304; M-l to Y-303; M-l to Y-302; M-l to G-301; M-l 
to L-300; M-l to V-299; M-l to A-298; M-l toV-297; M-l to A-296; M-l to Y-295; M- 
1 to R-294; M-l to T-293; M-l to V-292; M-l to N-291; M-l toD-290; M-l to R-289; 

15 M-l to E-288; M-l to S-287; M-l to Q-286; M-l to Q-285; M-l to 1-284; M-l to V- 
283;M-1 to K-282; M-l to E-281; M-l to L-280; M-l to D-279; M-l to P-278; M-l to S- 
277; M-l to D-276; M-l toH-275; M-l to S-274; M-l to E-273; M-l to G-272; M-l to 
D-271; M-l to T-270; M-l to 1-269; M-l to V-268;M-1 to 1-267; M-l to M-266; M-l to 
V-265; M-l to K-264; M-l to K-263; M-l to A-262; M-l to G-261; M-lto K-260; M-l 

20 to R-259; M-l to G-258; M-l to G-257; M-l to K-256; M-l to Q-255; M-l to F-254; M- 
1 toA-253; M-l to E-252; M-l to S-251; M-l to R-250; M-l to A-249; M-l to F-248; M- 
1 to E-247; M-l to I-246;M-1 to G-245; M-l to F-244; M-l to A-243; M-l to T-242; M- 
1 to R-241; M-l to T-240; M-l to E-239; M-l toT-238; M-l to G-237; M-l to G-236; 
M-l to R-235; M-l to Q-234; M-l to E-233; M-l to 1-232; M-l toH-231; M-l to S-230; 

25 M-l to A-229; M-l to A-228; M-l to E-227; M-l to V-226; M-l to V-225; M-l toD- 
224; M-l to K-223; M-l to V-222; M-l to S-22I; M-l to R-220; M-l to Y-219; M-l to 
D-218; M-l toN-217; M-l to L-216; M-l to H-215; M-l to F-214; M-l to E-213; M-l to 
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H-212; M-l to V-21 1; M-l toV-210; M-l to D-209; M-l to E-208; M-l to G-207; M-l 
to Y-206; M-l to Q-205; M-l to V-204; M-l toV-203; M-l to G-202; M-l to V-201 ; M- 
1 to Q-200; M-l to 1-199; M-l to Q-198; M-l to G-197; M-l toP-196; M-l to G-195; M- 
1 to 1-194; M-l to Y-193; M-l to F-192; M-l to K-191; M-l to K-190; M-l to L-189;M- 
5 1 to 1-188; M-l toN-187;M-l to 1-186; M-l toL-185;M-l toF-184;M-l toH-183;M- 
I toQ-182; M-l toV-181;M-l toE-180; M-ltoV-179; M-l toW-178;M-l toP-177; 
M-l to Y-176; M-l to 1-175; M-l to S-174;M-1 to N-173; M-l to S-172; M-l to G-171; 
M-l to D-170; M-l to L-169; M-l to V-168; M-l to 1-167; M-l toV-166; M-l to 1-165; 
M-l to D-164; M-l to M-163; M-l to Y-162; M-l to T-161; M-l to Q-160; M-l toC- 

10 159; M-l to R-158; M-l to Q-157; M-l to L-156; M-l to A-155; M-l to P-154; M-l to 
A-153;M-1 toV-152; M-l toT-151;M-l toK-150; M-l to S-149; M-l to F-148; M-l to 
R-147; M-l to F-146; M-l toN-145; M-l to S-144; M-l to N-143; M-l to V-142; M-l to 
R-I41; M-l to S-140; M-l to C-139; M-l toM-138; M-l to G-137; M-l toT-136; M-l to 
T-135; M-l to Y-134; M-l to Y-133; M-l to S-132; M-l toS-131; M-l to G-130; M-l to 

15 C-129; M-I to E-128; M-l to H-127; M-l to S-126; M-l to W-125; M-l toL-124; M-l to 
P-123; M-l to S-122; M-l to C-121; M-l to A-120; M-l to L-l 19; M-l to F-118; M-l to 
S-117;M-1 toN-116; M-l toD-115;M-l to K-114; M-l to P-113; M-l toN-112; M-l to 
T-l 11; M-l to A-110; M-lto L-109; M-l to S-108; M-l to L-107; M-l to G-106; M-l to 
L-105; M-l toR-104;M-l toM-103; M-l toN-102;M-l toD-101; M-l toK-100;M-l 

20 to R-99; M-l to E-98; M-l to S-97; M-l to V-96; M-l to N-95; M-lto S-94; M-l to L- 
93; M-l to T-92; M-l to V-91; M-l to R-90; M-l to G-89; M-l to L-88; M-l to N-87; 
M-lto L-86; M-l to K-85; M-l to T-84; M-l to C-83; M-l to N-82; M-l to G-81 ; M-l to 
H-80; M-l to 1-79; M-lto V-78; M-l to P-77; M-l to C-76; M-l to K-75; M-l to Y-74; 
M-l to V-73; M-l to D-72; M-l to G-71; M-lto T-70; M-l to K-69; M-l to Q-68; M-l 

25 to Y-67; M- 1 to G-66; M-l to N-65; M-l to T-64; M-l to E-63; M-lto L-62; M-l to P- 
61; M-l to A-60; M-l to G-59; M-l to V-58; M-l to V-57; M-l to L-56; M-l to W-55; 
M-lto K-54; M-l to N-53; M-l to G-52; M-l to S-51; M-l to 1-50; M-l to D-49; M-l to 
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H-48; M-l to Q-47; M-lto Q-46; M-l to V-45; M-l to T-44; M-l to Y-43; M-l to G-42; 
M-i to F-41; M-l to F-40; M-l to A-39; M-lto T-38; M-l to R-37; M-l to S-36; M-l to 
G-35; M-l to P-34; M-l to 1-33; M-l to V-32; M-l to R-31; M-l toP-30; M-l to K-29; 
M-l to R-28; M-l to T-27; M-i to D-26; M-l to M-25; M-l to N-24; M-l to F-23; M-l 
toT-22; M-l to D-21; M-l to T-20; M-l to F-19; M-l to G-18; M-l to P-17; M-l to W- 
16; M-l toL-15;M-l toS-14; M-l toL-13;M-l to A-12;M-1 to W-1I;M-1 to A-10; M- 
1 to V-9; M-J to V-8; M-l to L-7; M-l to G-6; of SEQ ID NO:35. Polypeptides encoded 
by these polynucleotides are also encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO: 17 which have been determined 
from the following related cDNA genes: HEEAB54R (SEQ ID NO: 104), HRDAF83R 
(SEQ ID NO: 105), HOUBC62R (SEQ ID NO:106), HCDBI19R (SEQ ID NO: 107), 
HOHCU94R (SEQ ID NO:108), HOACC13R (SEQ ID NO:109), HCDAP21R (SEQ ID 
NO: 110), HNHHA34R (SEQ ID NO:l 1 1), HOHEA75R (SEQ ID NO:l 12) and 
HNGEL59R (SEQ ID NO:l 13). 

Based on the sequence similarity to the human integrin alpha 1 subunit, 
translation product of this gene is expected to share at least some biological activities 
with integrin proteins, and specifically the integrin alpha 1 protein. Such activities are 
known in the art, some of which are described elsewhere herein. 

Specifically, polynucleotides and polypeptides of the invention are also useful for 
modulating the differentiation of normal and malignant cells, modulating the 
proliferation and/or differentiation of cancer and neoplastic cells, and modulating the 
immune response. Polynucleotides and polypeptides of the invention may represent a 
diagnostic marker for hematopoietic and immune diseases and/or disorders. The full- 
length protein should be a secreted protein, based upon homology to the integrin family. 
Therefore, it is secreted into serum, urine, or feces and thus the levels is assayable from 
patient samples. Assuming specific expression levels are reflective of the presence of 
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immune disorders, this protein would provide a convenient diagnostic for early detection. 
In addition, expression of this gene product may also be linked to the progression of 
immune diseases, and therefore may itself actually represent a therapeutic or therapeutic 
target for the treatment of cancer. 
5 Polynucleotides and polypeptides of the invention may play an important role in 

the pathogenesis of human cancers and cellular transformation, particularly those of the 
immune and hematopoietic systems. Polynucleotides and polypeptides of the invention 
may also be involved in the pathogenesis of developmental abnormalities based upon its 
potential effects on proliferation and differentiation of cells and tissue cell types. Due to 

10 the potential proliferating and differentiating activity of said polynucleotides and 
polypeptides, the invention is useful as a therapeutic agent in inducing tissue 
regeneration, for treating inflammatory conditions (e.g., inflammatory bowel syndrome, 
diverticulitis, etc.). Moreover, the invention is useful in modulating the immune 
response to aberrant polypeptides, as may exist in rapidly proliferating cells and tissue 

15 cell types, particularly in adenocarcinoma cells, and other cancers. 

Alternatively, the expression within cellular sources marked by proliferating cells 
indicates this protein may play a role in the regulation of cellular division, and may show 
utility in the diagnosis, treatment, and/or prevention of developmental diseases and 
disorders, including cancer, and other proliferative conditions. Representative uses are 

20 described in the "Hyperproliferative Disorders" and "Regeneration" sections below and 
elsewhere herein. Briefly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell death, 
as occurs in the development of some cancers, or in failure to control the extent of cell 

25 death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 
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Alternatively, this gene product is involved in the pattern of cellular proliferation 
that accompanies early embryogenesis. Thus, aberrant expression of this gene product in 
tissues - particularly adult tissues - may correlate with patterns of abnormal cellular 
proliferation, such as found in various cancers. Because of potential roles in proliferation 
and differentiation, this gene product may have applications in the adult for tissue 
regeneration and the treatment of cancers. It may also act as a morphogen to control cell 
and tissue type specification. Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus 
this protein may modulate apoptosis or tissue differentiation and is useful in the 
detection, treatment, and/or prevention of degenerative or proliferative conditions and 
diseases. The protein is useful in modulating the immune response to aberrant 
polypeptides, as may exist in proliferating and cancerous cells and tissues. The protein 
can also be used to gain new insight into the regulation of cellular growth and 
proliferation. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify 
agents that modulate their interactions, in addition to its use as a nutritional supplement. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

This gene is expressed almost exclusively in osteoblasts, human trabecular bone 
cells, messangial cells, adipocytes, and to a lesser extent in osteosarcoma, 
chondrosarcoma, breast cancer cells, and bone marrow. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of the following diseases and conditions which 
include, but are not limited to, disorders of the skeletal system, connective tissues, and 
immune and hematpoietic diseases and/or disorders. Similarly, polypeptides and 
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antibodies directed to these polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of the 
above tissues or cells, particularly of the connective tissue and skeletal system, 
expression of this gene at significantly higher or lower levels is detected in certain 
5 tissues or cell types (e.g. immune, hematopoietic, skeletal, bone, cartilage, 

develpomental, reproductive, secretory, and cancerous and wounded tissues) or bodily 
fluids or cell types (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue from an 

10 individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 35 as residues: Phe-23 to Arg-31, Leu-62 to Asp-72, Vai-96 to 
Asp-101, Thr-111 to Asn-116, Glu-128 toThr-135, Val-142 to Ser-149, Asn-217 to Val- 
222, Glu-233 to Arg-241, Gly-272 to Leu-280, Gln-286 to Thr-293, Tyr-303 to IIe-308, 

15 Gly-354 to Thr-360, Glu-408 to Lys-419, GIu-508 to Lys-514, Arg-521 to Val-526, Gly- 
529 to Phe-542, Asp-551 to Tyr-557, Thr-587 to Thr-593, His-656 to Asp-665, Met-697 
to Arg-705, Asp-709 to Thr-716, Glu-755 to Gly-760, Asn-779 to His-786, Leu-810 to 
Asp-816, Leu-844 to Ala-851, GIn-871 to Gly-877, Glu-884 to GIn-889, Ser-931 to Asn- 
943, Ser-974 to Ile-982, Gly-1039 to Gln-1058, Arg-1 121 to Arg-1127, Ser-1134 to Trp- 

20 1 139, Ser-1 172 to Pro-1 183. Polynucleotides encoding said polypeptides are also 
provided. 

The tissue distribution in osteoblasts and homology to integrin alpha subunit 10 
indicates that the protein products of this gene are useful for the treatment of disorders 
and conditions affecting the skeletal system, in particular osteoporosis as well as 
25 disorders afflicting connective tissues (e.g. arthritis, trauma, tendonitis, chrondomalacia 
and inflammation), such as in the diagnosis and treatment of various autoimmune 
disorders such as rheumatoid arthritis, lupus, scleroderma, and dermatomyositis as well 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

133 

as dwarfism, spinal deformation, and specific joint abnormalities as well as 
chondrodysplasias (ie. spondyloepiphyseal dysplasia congenita, familial osteoarthritis, 
Atelosteogenesis type II, metaphyseal chondrodysplasia type Schmid). polynucleotides 
and polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 
thrombocytopenia or leukemia since stromal cells are important in the production of cells 
of hematopoietic lineages. Such a use is consistent with the observed homology to 
integrin family members, in conjunction with The tissue distribution in bone 
marrow cells. Integrins play pivotal roles in cell migration, inflammation, proliferation, 
and cellular infiltration. Thus, the present invention is expected to share at least some of 
these activities. Representative uses are described in the "Immune Activity" and 
"infectious disease" sections below, in Example 11, 13, 14, 16, 18, 19, 20, and 27, and 
elsewhere herein. Briefly, the uses include bone marrow cell ex-vivo culture, bone 
marrow transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 
neoplasia. The gene product may also be involved in lymphopoiesis, therefore, it can be 
used in immune disorders such as infection, inflammation, allergy, immunodeficiency 
etc. In addition, this gene product may have commercial utility in the expansion of stem 
cells and committed progenitors of various blood lineages, and in the differentiation 
and/or proliferation of various cell types. Based upon the tissue distribution of this 
protein, antagonists directed against this protein is useful in blocking the activity of this 
protein. Accordingly, preferred are antibodies which specifically bind a portion of the 
translation product of this gene. 

Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 
product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 
for the translation product of this gene to a bodily fluid from the individual, preferably 
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serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 
above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
are more particularly described elsewhere herein. Furthermore, the protein may also be 
5 used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. Many polynucleotide sequences, such as EST sequences, are 

10 publicly available and accessible through sequence databases. Some of these sequences 
are related to SEQ ID NO: 17 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 

15 more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 4981 of SEQ ID NO:17, b is an 
integer of 15 to 4995, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 17, and where b is greater than or equal to a + 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 8 

The present invention relates to three novel peptidoglycan recognition binding 
proteins expressed by keratinocytes, wound-healing tissues and chondrosarcoma tissue. 
25 More specifically, isolated nucleic acid molecules are provided encoding a human 
peptidoglycan recognition protein-related protein, sometimes referred to herein as 
"human tag7" or M tag7" or "htag7". Further provided are vectors, host cells and 
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recombinant methods for producing the same. The invention also relates to both the 
inhibition and enhancement of activities of the tag7 protein, polypeptides and diagnostic 
methods for detecting tag7 gene expression. 

Peptidoglycan, as well as Lipopolysaccharide (LPS), is a surface component of 
5 many bacteria which illicit a wide range of physiological and immune responses in 
humans. Specifically, peptidoglycan has been shown to manifest itself clinically by 
reproducing most of the symptoms of bacterial infection, including fever, acute-phase 
response, inflammation, septic shock, leukocytosis, sleepiness, malaise, abcess 
formation, and arthritis (see Dziarski et al., JBC, 273 (15): 8680 (1998)). Furthermore, 

10 the type of peptidoglycan (i.e.- the specific stereoisomers or analogs of muramyl 

dipeptide, N-acetylglucosaminyl-beta(l-4)-N-acteylmuramyl tetrapeptides, etc.), were 
shown to elicit a broad range of activities, including exhibiting greater pyrogenicity, 
inducing acute joint inflammation, stimulating macrophages, and causing hemorrhagic 
necrosis at a primed site (See Kotani et al., Fed Proc, 45(1 1): 2534 (1986)), It has been 

15 demonstrated in humans that a lipopolysaccharide binding protein exists that was 
discovered as a trace plasma protein (See Schumann et al„ Science, 249(4975): 1429 
(1990)). It is thought that one of the modes of action by which this lipopolysaccharide 
binding protein functions is by forming high-affinity complexes with lipopolysaccharide, 
that then bind to macrophages and monocytes, inducing the secretion of tumor necrosis 

20 factor. Dziarski and Gupta (See Dziarski et al., JBC, 269(3): 2100 (1994)) demonstrated 
that a 70kDa receptor protein present on the surface of mouse lymphocytes served to 
bind heparin, heparinoids, bacterial lipoteichoic acids, peptidoglycan, and 
lipopolysaccharides. Recently, Dziarski et al. demonstrated that the CD14, a 
glycosylphosphatidylinositol-linked protein present on the surface of macrophage and 

25 polymorphonuclear leukocytes, bound peptidoglycan and lipopolysaccharide. 

Furthermore, the binding affinity of CD 14 for lipopolysaccharide was 
significantly increased in the presence of a LPS-binding protein present in plasma. It is 
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thought that the LPS-binding protein functions as a transfer molecule, whereby it binds 
LPS and presents it to the CD14 receptor (See Dziarski et al., JBC, 273(15): 8680 
(1998)). Yoshida et al. isolated a peptidoglycan binding protein from the hemolymph of 
the Silkworm, Bombyx mori, using column chromatography. This protein was found to 
5 have a very specific affinity for peptidoglycan (See Yoshida et al., JBC, 271(23): 13854 
(1996)). 

Additionally, Kang et al. recently cloned a peptidoglycan binding protein from 
the moth Trichoplusia ni. The peptidoglycan binding protein was shown to bind strongly 
to insoluble peptidoglycan (See Kang etal., PNAS, 95(17): 10078 (1998)). In this study 

10 the peptidoglycan binding protein was upregulated by a bacterial infection in T. ni. The 
insect immune system is regarded as a model for innate immunity. Thus, Kang et al were 
able to gene both mouse and human homologs of the T. ni peptidoglycan binding protein. 
All of these peptidoglycan binding proteins shared regions of homology, as well as four 
conserved cysteine residues which may function in the tertiary structure of the protein, 

15 possibly in helping to form binding domains. Given that peptidoglycan is an integral 

component of bacterial cell walls, and that it induces many physiological responses from 
cytokine secretion to inflammation and macrophage activation, it appears as if this family 
of proteins is a ubiquitous group involved in the binding and recognition of 
peptidoglycan, the presentation of antigens (e.g., cell wall components, etc.), and the 

20 activation of the immune system, such as the secretion of cytokines, such as TNF. TNF is 
noted for its pro-inflammatory actions which result in tissue injury, such as induction of 
procoagulant activity on vascular endothelial cells (Pober, J.S. et al., J. Immunol. 
136:1680 (1986)), increased adherence of neutrophils and lymphocytes (Pober, J.S. et al., 
J. Immunol. 138:3319 (1987)), and stimulation of the release of platelet activating factor 

25 from macrophages, neutrophils and vascular endothelial cells (Camussi, G. et al., J. Exp. 
Med. 166:1390(1987)). 
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Recent evidence implicates TNF in the pathogenesis of many infections (Cerami, 
A. et al., Immunol. Today 9:28 (1988)), immune disorders, neoplastic pathology, e.g., in 
cachexia accompanying some malignancies (Oliff, A. et al., Cell 50:555 (1987)), and in 
autoimmune pathologies and graft-versus host pathology (Piguet, P.-F. et al., J. Exp. 
5 Med. 166:1280 (1987)). The association of TNF with cancer and infectious pathologies 
is often related to the host's catabolic state. A major problem in cancer patients is weight 
loss, usually associated with anorexia. The extensive wasting which results is known as 
"cachexia" (Kern, K. A. et al. J. Parent. Enter. Nutr. 12:286-298 (1988)). Cachexia 
includes progressive weight loss, anorexia, and persistent erosion of body mass in 

10 response to a malignant growth. The cachectic state is thus associated with significant 
morbidity and is responsible for the majority of cancer mortality. 

A number of studies have suggested that TNF is an important mediator of the 
cachexia in cancer, infectious pathology, and in other catabolic states. TNF is thought to 
play a central role in the pathophysiological consequences of Gram-negative sepsis and 

15 endotoxic shock (Michie, H.R. et al., Br. J. Surg. 76:670-671 (1989); Debets, J. M. H. et 
al., Second Vienna Shock Forum, p.463-466 (1989); Simpson, S. Q. et al., Crit. Care 
Clin. 5:27-47 (1989)), including fever, malaise, anorexia, and cachexia. Endotoxin is a 
potent monocyte/macrophage activator which stimulates production and secretion of 
TNF (Kornbluth, S.K. et al., J. Immunol. 137:2585-2591 (1986)) and other cytokines. 

20 Because TNF could mimic many biological effects of endotoxin, it was concluded to be a 
central mediator responsible for the clinical manifestations of endotoxin-related illness. 
TNF and other monocyte-derived cytokines mediate the metabolic and neurohormonal 
responses to endotoxin (Michie, H.R. et al., N. Eng. J. Med. 318:1481-1486 (1988)). 
Endotoxin administration to human volunteers produces acute illness with flu-like 

25 symptoms including fever, tachycardia, increased metabolic rate and stress hormone 
release (Revhaug, A. et al., Arch. Surg. 123:162-170 (1988)). Elevated levels of 
circulating TNF have also been found in patients suffering from Gram-negative sepsis 
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(Waage, A. et al., Lancet 1:355-357 (1987); Hammerle, A.F. et al., Second Vienna Shock 
Forum p. 715-718 (1989); Debets, J. M. H. et al., Crit. Care Med. 17:489-497 (1989); 
Calandra, T. et al., J. Infec. Dis. 161:982-987 (1990)). Passive immunotherapy directed 
at neutralizing TNF may have a beneficial effect in Gram-negative sepsis and 
5 endotoxemia, based on the increased TNF production and elevated TNF levels in these 
pathology states, as discussed above. 

Antibodies to a "modulator" material which was characterized as cachectin (later 
found to be identical to TNF) were disclosed by Cerami et al. (EPO Patent Publication 
0,212,489, March 4, 1987). Such antibodies were said to be useful in diagnostic 

10 immunoassays and in therapy of shock in bacterial infections. Rubin et al. (EPO Patent 
Publication 0,218,868, April 22, 1987) disclosed monoclonal antibodies to human TNF, 
the hybridomas secreting such antibodies, methods of producing such antibodies, and the 
use of such antibodies in immunoassay of TNF. Yone et al. (EPO Patent Publication 
0,288,088, October 26, 1988) disclosed anti-TNF antibodies, including mAbs, and their 

15 utility in immunoassay diagnosis of pathologies, in particular Kawasaki's pathology and 
bacterial infection. The body fluids of patients with Kawasaki's pathology (infantile acute 
febrile mucocutaneous lymph node syndrome; Kawasaki, T., Allergy 16:178 (1967); 
Kawasaki, T., Shonica (Pediatrics) 26:935 (1985)) were said to contain elevated TNF 
levels which were related to progress of the pathology (Yone et al., supra). 

20 Accordingly, there is a need to provide molecules that are involved in 

pathological conditions. Such novel proteins could be useful in augmenting the immune 
system in such areas as immune recognition, antigen presentation, and immune system 
activation. Antibodies or antagonists directed against these proteins is useful in reducing 
or eliminating disorders associated with TNF and TNF-like cytokines, such as endotoxic 

25 shock and auto-immune disorders, for example. 

The polypeptide of the present invention has been putatively identified as a 
member of the novel peptidoglycan recognition binding protein family and has been 
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termed human tag7. This identification has been made as a result of amino acid sequence 
homology to the mouse tag7 (See Genbank Accession No. emb|CAA60133). 

Figure 34 shows the nucleotide (SEQ ID NO: 18) and deduced amino acid 
sequence (SEQ ID NO:36) of htag7. Predicted amino acids from about 1 to about 21 
5 constitute the predicted signal peptide (amino acid residues from about 1 to about 21 in 
SEQ ID NO:36) and are represented by the underlined amino acid regions; and amino 
acids from about 34 to about 117 constitute the predicted PGRP-like domain (amino 
acids from about 34 to about 1 17 in SEQ ID NO:36) and are represented by the double 
underlined amino acids. 

10 Figure 35 shows the regions of similarity between the amino acid sequences of 

the htag7 protein (SEQ ID NO:36) and the mouse tag7 protein (SEQ ID NO: 1 14). 

Figure 36 shows an analysis of the htag7 amino acid sequence. Alpha, beta, turn 
and coil regions; hydrophilicity and hydrophobicity; amphipathic regions; flexible 
regions; antigenic index and surface probability are shown. 

15 A polynucleotide encoding a polypeptide of the present invention is obtained 

from human chondrosarcoma cells, bone marrow, and neutrophils. The polynucleotide of 
this invention was discovered in a human chondrosarcoma cDNA library. 

As shown in Figure 34, htag7 has a PGRP domain (the PGRP domain comprise 
amino acids from about 34 to about 1 17 of SEQ ID NO:36; which correspond to amino 

20 acids from about 34 to about 1 17 of Figure 34). The polynucleotide contains an open 
reading frame encoding the htag7 polypeptide of 198 amino acids. htag7 exhibits a high 
degree of homology at the amino acid level to the mouse tag7 (as shown in Figure 35). 
The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the htag7 polypeptide having the amino acid sequence shown in 

25 Figure 34 (SEQ ID NO:36). The nucleotide sequence shown in Figure 34 (SEQ ID 

NO: 18) was obtained by sequencing a cloned cDNA (HCDDP40), which was deposited 
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on November 17 at the American Type Culture Collection, and given Accession Number 
203484. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
5 nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO: 18 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 

10 invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO: 18. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 18. In this context "about" includes the particularly recited size, 

15 larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of htag7 polynucleotide fragments of the invention 
include, for example, fragments that comprise, or alternatively, consist of, a sequence 
from about nucleotide 1 to about 50, from about 51 to about 100, from about 101 to 
about 150, from about 151 to about 200, from about 201 to about 250, from about 251 to 

20 about 300, from about 301 to about 350, from about 351 to about 400, from about 401 to 
about 450, from about 451 to about 500, from about 501 to about 550, from about 551 to 
about 600, from about 601 to about 650, from about 651 to about 700, from about 701 to 
about 726, and from about 130 to about 379 of SEQ ID NO:18, or the complementary 
strand thereto, or the cDNA contained in the deposited gene. In this context "about" 

25 includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1 ) 
nucleotides, at either terminus or at both termini. 
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Preferred nucleic acid fragments of the present invention include nucleic acid 
molecules encoding a member selected from the group: a polypeptide comprising or 
alternatively, consisting of, the PGRP-like domain (amino acid residues from about 34 to 
about 1 17 in Figure 34 (amino acids from about 34 to about 1 17 in SEQ ID NO:36). 
5 Since the location of these domains have been predicted by computer analysis, one of 
ordinary skill would appreciate that the amino acid residues constituting these domains 
may vary slightly (e.g., by about 1 to 15 amino acid residues) depending on the criteria 
used to define each domain. As indicated, nucleic acid molecules of the present invention 
which encode a htag7 polypeptide may include, but are not limited to those encoding the 

10 amino acid sequence of the PGRP-like domain of the polypeptide, by itself; and the 

coding sequence for the PGRP-like domain of the polypeptide and additional sequences, 
such as a pre-, or pro or prepro- protein sequence. In additional embodiments, the 
polynucleotides of the invention encode functional attributes of htag7. 

Preferred embodiments of the invention in this regard include fragments that 

15 comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), tum and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions of htag7. The data 

20 representing the structural or functional attributes of htag7 set forth in Figure 36 and/or 
Table XII, as described above, was generated using the various modules and algorithms 
of the DNA*STAR set on default parameters. In a preferred embodiment, the data 
presented in columns VIII, IX, XIII, and XIV of Table XII can be used to determine 
regions of htag7 which exhibit a high degree of potential for antigenicity. Regions of 

25 high antigenicity are determined from the data presented in columns VIII, IX, XIII, 
and/or XIV by choosing values which represent regions of the polypeptide which are 
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likely to be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 36, but may, as 
shown in Table XII, be represented or identified by using tabular representations of the 
5 data presented in Figure 36. The DNA*STAR computer algorithm used to generate 
Figure 36 (set on the original default parameters) was used to present the data in Figure 
36 in a tabular format (See Table XII). The tabular format of the data in Figure 36 is used 
to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 36 and in Table XII include, but are not limited to, 

10 regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figure 34. As set out in Figure 36 and in Table Xll, such preferred regions include 
Gamier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, Chou- 
Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic regions 
and Hopp-Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic regions, 

1 5 Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index and 

Emini surface-forming regions. Even if deletion of one or more amino acids from the N- 
terminus of a protein results in modification of loss of one or more biological functions 
of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, modulate cellular interaction, or signalling pathways, etc.) may still be 

20 retained. For example, the ability of shortened htag7 muteins to induce and/or bind to 
antibodies which recognize the complete or mature forms of the polypeptides generally 
will be retained when less than the majority of the residues of the complete or mature 
polypeptide are removed from the N-terminus. Whether a particular polypeptide lacking 
N-terminal residues of a complete polypeptide retains such immunologic activities can 

25 readily be determined by routine methods described herein and otherwise known in the 
art. It is not unlikely that an htag7 mutein with a large number of deleted N-terminal 
amino acid residues may retain some biological or immunogenic activities. In fact, 
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peptides composed of as few as six htag7 amino acid residues may often evoke an 
immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the htag7 amino acid sequence shown 
in Figure 34, up to the proline residue at position number 191 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues n 1 - 1 96 of Figure 34 , where n 1 is an 
integer from 2 to 191 corresponding to the position of the amino acid residue in Figure 
34 (which is identical to the sequence shown as SEQ ID NO:36). In another embodiment, 
N-terminal deletions of the htag7 polypeptide can be described by the general formula 
n2-196, where n2 is a number from 2 to 191, corresponding to the position of amino acid 
identified in Figure 34. N-terminal deletions of the htag7 polypeptide of the invention 
shown as SEQ ID NO:36 include polypeptides comprising the amino acid sequence of 
residues: N-terminal deletions of the htag7 polypeptide of the invention shown as SEQ 
ID NO:36 include polypeptides comprising the amino acid sequence of residues: S-2 to 
P-196; R-3 to P-196; R-4 to P-196; S-5 to P-196; M-6 to P-196; L-7 to P-196; L-8 to P- 
196; A-9 to P-196; W-10 to P-196; A-l 1 to P-196; L-12 to P-196; P-13 to P-196; S-14 to 
P-196; L-I5 to P-196; L-16 to P-196; R-17 to P-196; L-18 to P-196; G-19 to P-196; A-20 
to P-196; A-21 to P-196; Q-22 to P-196; E-23 to P-196; T-24 to P-196; E-25 to P-196; 
D-26 to P-196; P-27 to P-196; A-28 to P-196; C-29 to P-196; C-30 to P-196; S-31 to P- 
196; P-32 to P-196; 1-33 to P-196; V-34 to P-196; P-35 to P-196; R-36 to P-196; N-37 to 
P-196; E-38 to P-196; W-39 to P-196; K-40 to P-196; A-41 to P-196; L-42 to P-196; A- 
43 to P-196; S-44 to P-196; E-45 to P-196; C-46 to P-196; A-47 to P-196; Q-48 to P- 
196; H-49 to P-196; L-50 to P-196; S-51 to P-196; L-52 to P-196; P-53 to P-196; L-54 to 
P-196; R-55 to P-196; Y-56 to P-196; V-57 to P-196; V-58 to P-196; V-59 to P-196; S- 
60 to P-196; H-61 to P-196; T-62 to P-196; A-63 to P-196; G-64 to P-196; S-65 to P- 
196; S-66 to P-196; C-67 to P-196; N-68 to P-196; T-69 to P-196; P-70 to P-196; A-7I 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

144 

to P-196; S-72 to P-196; C-73 to P-196; Q-74 to P-196; Q-75 to P-196; Q-76 to P-196; 
A-77 to P-196; R-78 to P-196; N-79 to P-196; V-80 to P-196; Q-81 to P-196; H-82 to P- 
196; Y-83 to P-196; H-84 to P-196; M-85 to P-196; K-86 to P-196; T-87 to P-196; L-88 
to P-196; G-89 to P-196; W-90 to P-196; C-91 to P-196; D-92 to P-196; V-93 to P-196; 
5 G-94 to P-196; Y-95 to P-196; N-96 to P-196; F-97 to P-196; L-98 to P-196; 1-99 to P- 
196; G-I0O to P-196; E-101 to P-196; D-102 to P-196; G-103 to P-196; L-104 to P-196; 
V-105 to P-196; Y-106 to P-196; E-107 to P-196; G-108 to P-196; R-109 to P-196; G- 
1 10 to P-196; W-lll to P-196; N-112 to P-196; F-l 13 to P-196; T-l 14 to P-196; G-l 15 
to P-196; A-l 16 to P-196; H-l 17 to P-196; S-l 18 to P-196; G-l 19 to P-196; H-120 to P- 

10 196: L-121 to P-196; W-122 to P-196; N-123 to P-196; P-124 to P-196; M-125 to P-196; 
S-126 to P-196; 1-127 to P-196; G-128 to P-196; 1-129 to P-196; S-130 to P-196; F-131 
to P-196; M-132 to P-196; G-133 to P-196; N-134 to P-196; Y-135 to P-196; M-136 to 
P-196; D-137 to P-196; R-138 to P-196; V-139 to P-196; P-140 to P-196; T-141 to P- 
196; P-142 to P-196; Q-143 to P-196; A-144 to P-196; 1-145 to P-196; R-146 to P-196; 

15 A- 147 to P-196; A- 148 to P-196; Q-149 to P-196; G-150 to P-196; L-151 to P-196; L- 
152 to P-196; A-153 to P-196; C-154 to P-196; G-155 to P-196; V-156 to P-196; A-157 
to P-196; Q-158 to P-196; G-159 to P-196; A-160 to P-196; L-161 to P-196; R-162 to P- 
196; S-163 to P-196; N-164 to P-196; Y-165 to P-196; V-166 to P-196: L-167 to P-196; 
K-168 to P-196; G-169 to P-196; H-170 to P-196; R-171 to P-196; D-172 to P-196; V- 

20 173 to P-196; Q-174 to P-196; R-175 to P-196; T-176 to P-196; L-177 to P-196; S-178 to 
P-196; P-179 to P-196; G-180 to P-196; N-181 to P-196; Q-182 to P-196; L-183 to P- 
196; Y-184 to P-196; H-185 to P-196; L-186 to P-196; 1-187 to P-196; Q-188 to P-196; 
N-189 to P-196; W-190 to P-196; P-191 to P-196; of SEQ ID NO:36. Polypeptides 
encoded by these polynucleotides are also encompassed by the invention. 

25 Also as mentioned above, even if deletion of one or more amino acids from the 

C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities ) may still 
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be retained. For example the ability of the shortened htag7 mutein to induce and/or bind 
to antibodies which recognize the complete or mature forms of the polypeptide generally 
will be retained when less than the majority of the residues of the complete or mature 
polypeptide are removed from the C-terminus. Whether a particular polypeptide lacking 
C-terminal residues of a complete polypeptide retains such immunologic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. It is not unlikely that a htag7 mutein with a large number of deleted C-terminal 
amino acid residues may retain some biological or immunogenic activities. In fact, 
peptides composed of as few as six htag7 amino acid residues may often evoke an 
immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the htag7 
polypeptide shown in Figure 34 , up to the methionine residue at position number 6, and 
polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues 1-ml of Figure 1, where 
ml is an integer from 6 to 196 corresponding to the position of the amino acid residue in 
Figure 34 . Moreover, the invention provides polynucleotides encoding polypeptides 
comprising, or alternatively consisting of, the amino acid sequence of C-terminal 
deletions of the htag7 polypeptide of the invention shown as SEQ ID NO:36 include 
polypeptides comprising the amino acid sequence of residues: M-l to S-195; M-I to R- 
194; M-l to Y-I93;M-1 to H-192; M-l to P-I91;M-1 to W-190; M-l toN-189; M-l to 
Q-188; M-l to 1-187; M-l to L-186; M-l to H-185; M-l to Y-184; M-l to L-183; M-l to 
Q-182; M-l to N-181; M-l to G-180; M-l to P-179; M-l to S-178; M-l to L-177; M-l to 
T-176; M-l to R-175; M-l to Q-174; M-l to V-173; M-l to D-172; M-l to R-I7I; M-l 
to H-170; M-l to G-169; M-l to K-168; M-l to L-167; M-l to V-166; M-l to Y-165; M- 
1 to N-164; M-l to S-163; M-l to R-162; M-l to L-161; M-l to A-160; M-l to G-159; 
M-l to Q-158; M-l to A-157; M-l to V-156; M-l to G-J55; M-l to C-154; M-l to A- 
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153; M-l to L-152; M-l to L-151; M-l to G-150; M-l to Q-149; M-l to A-148; M-l to 
A-147; M-l to R-146; M-l to 1-145; M-l to A-144; M-l to Q-143; M-l to P-142; M-l to 
T-141;M-1 toP-140; M-l to V-139; M-l toR-138;M-l toD-137;M-l toM-136;M-l 
to Y-135; M-l to N-134; M-l to G-133; M-l to M-132; M-l to F-131; M-l to S-130; M- 
5 1 to 1-129; M-l to G-128; M-l to 1-127; M-l to S-126; M-l to M-125; M-l to P-124; M- 
1 toN-123; M-l toW-122; M-l toL-121;M-l toH-120; M-l toG-119; M-l toS-118; 
M-l to H-117;M-1 toA-116; M-l toG-115; M-l toT-114;M-l toF-H3;M-l to N- 
112; M-l to W-111;M-1 toG-110; M-l toR-109; M-l toG-108; M-l toE-107; M-l to 
Y-106; M-l to V-105;M-1 toL-104; M-l to G-103;M-1 toD-102; M-l toE-101; M-l 

10 to G-100; M-l to 1-99; M-l to L-98; M-l to F-97; M-l to N-96; M-l to Y-95; M-l to G- 
94; M-l to V-93; M-l toD-92; M-l toC-91; M-l toW-90; M-l toG-89;M-l toL-88; 
M-l to T-87; M-l to K-86: M-l to M-85; M-l to H-84; M-l to Y-83; M-l to H-82; M-l 
to Q-81; M-l to V-80; M-l to N-79; M-l to R-78; M-l to A-77; M-l to Q-76; M-l to Q- 
75; M-l to Q-74; M-l to C-73; M-l to S-72; M-l to A-71; M-l to P-70; M-l to T-69; M- 

15 1 to N-68; M-l to C-67; M-l to S-66; M-l to S-65; M-l to G-64; M-l to A-63; M-l to T- 
62; M-l to H-61; M-l to S-60: M-l to V-59; M-l to V-58; M-l to V-57; M-l to Y-56; 
M-l to R-55; M-l to L-54; M-l to P-53; M-l to L-52; M-l to S-51; M-l to L-50; M-l to 
H-49; M-l to Q-48; M-l to A-47; M-l to C-46; M-l to E-45; M-l to S-44; M-l to A-43; 
M-l to L-42; M-l to A-41; M-l to K-40; M-l to W-39; M-l to E-38; M-l to N-37; M-l 

20 to R-36; M-l to P-35; M-l to V-34; M-l to 1-33; M-l to P-32; M-l to S-31; M-l to C- 
30; M-l to C-29; M-l to A-28; M-l to P-27; M-l to D-26; M-l to E-25; M-l to T-24; M- 
1 to E-23; M-l to Q-22; M-l to A-21; M-l to A-20; M-l to G-19; M-l to L-18; M-l to 
R-17; M-l to L-16; M-l to L-15; M-l to S-14; M-l to P-13; M-l to L-12; M-l to A-l 1; 
M-l to W-10; M-l to A-9; M-l to L-8; M-l to L-7; M-l to M-6; of SEQ ID NO:36. 

25 Polypeptides encoded by these polynucleotides are also encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO:36 which have been determined 
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from the following related cDNA genes: HBMTB79R (SEQ ID NO:l 15) and 
HCDDP40R (SEQ ID NO: 1 16). 

Based on the sequence similarity to the mouse tag7 and the PGRP-like domain, 
translation product of this gene is expected to share at least some biological activities 
with tag7 proteins, and specifically cytokine modulatory proteins. Such activities are 
known in the art, some of which are described elsewhere herein. Specifically, 
polynucleotides and polypeptides of the invention are also useful for modulating the 
differentiation of normal and malignant cells, modulating the proliferation and/or 
differentiation of cancer and neoplastic cells, and modulating the immune response. 
Polynucleotides and polypeptides of the invention may represent a diagnostic marker for 
hematopoietic and immune diseases and/or disorders. The full-length protein should be a 
secreted protein, based upon homology to the tag7 protein. Therefore, it is secreted into 
serum, urine, or feces and thus the levels is assayable from patient samples. Assuming 
specific expression levels are reflective of the presence of immune disorders, this protein 
would provide a convenient diagnostic for early detection. In addition, expression of this 
gene product may also be linked to the progression of immune diseases, and therefore 
may itself actually represent a therapeutic or therapeutic target for the treatment of 
cancer. 

Polynucleotides and polypeptides of the invention may play an important role in 
the pathogenesis of human cancers and cellular transformation, particularly those of the 
immune and hematopoietic systems. Polynucleotides and polypeptides of the invention 
may also be involved in the pathogenesis of developmental abnormalities based upon its 
potential effects on proliferation and differentiation of cells and tissue cell types. Due to 
the potential proliferating and differentiating activity of said polynucleotides and 
polypeptides, the invention is useful as a therapeutic agent in inducing tissue 
regeneration, for treating inflammatory conditions (e.g., inflammatory bowel syndrome, 
diverticulitis, etc.). 
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Moreover, the invention is useful in modulating the immune response to aberrant 
polypeptides, as may exist in rapidly proliferating cells and tissue cell types, particularly 
in adenocarcinoma cells, and other cancers. The translation product of this gene shares 
sequence homology with Tag7, which is a mouse cytokine that, in soluble form, triggers 
5 apoptosis in mouse L929 cells in vitro. 

The translation product of this gene also shares sequence homology with 
antimicrobial BGP-A, a bovine antimicrobial peptide from bovine neutrophils. Preferred 
polypeptides of this invention comprise residues 184 to 196 shown in SEQ ID NO: 36. 
This polypeptide is believed to be the active mature form of the translation product of 
10 this gene. 

This gene is expressed primarily in bone marrow and to a lesser extent in human 
chondrosarcoma and neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, infections, cancer, and disorders of the immune system. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of infected tissues and the 

20 immune system, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g. immune, hematopoietic, and 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the expression 

25 level in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 36 as residues: Ala-63 to Asn-68, Ala-71 to Gln-81, Tyr-135 to 
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Thr-141, Leu-167 to Gln-174, Pro-191 to Pro-196. Polynucleotides encoding said 
polypeptides are also provided. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 9 



This invention relates to newly identified polynucleotides, polypeptides encoded 
by such polynucleotides, the use of such polynucleotides and polypeptides, as well as the 
production of such polynucleotides and polypeptides. The polypeptide of the present 
invention has been putatively identified as a human butyrophilin homolog derived from a 
human testes tumor cDNA library. The polypeptide of the present invention is sometimes 
hereafter referred to as "Butyrophlin and B7-like IgG superfamily receptor", and/or 
"BBIR II". The invention also relates to inhibiting the action of such polypeptides. 

Butyrophilin is a glycoprotein of the immunoglobulin superfamily that is secreted 
in association with the milk-fat-globule membrane from mammary epithelial cells. The 
butyrophilin gene appears to have evolved from a subset of genes in the immunoglobulin 
superfamily and genes encoding the B30.2 domain, which is conserved in a family of 
zinc-finger proteins. Furthermore, expression analysis of butyrophilin genes has shown 
that butyrophilin expression increases during lactation in conjunction with an increase in 
milk fat content. These results suggest that the stage-specific expression of milk fat 
globule membrane glycoproteins in mammary epithelial cells is regulated in a similar but 
not necessarily identical mechanism to that of a major milk protein, beta-casein. 

The polypeptide of the present invention has been putatively identified as a 
member of the milk fat globule membrane glycoprotein family, and more particularly the 
butyrophilin family, and has been termed Butyrophlin and B7-like IgG superfamily 
receptor ("BBIR 11"). This identification has been made as a result of amino acid 
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sequence homology to the bovine butyrophilin precursor (See Genbank Accession No. 
gi|162773). 

Preferred polypeptides of the invention comprise the following nucleic acid 
sequence: 

5 ACATCCATGGCTCTAATGCTCAGTTTGGTTCTGAGTCTCCTCAAGCTGGGATC 
AGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGG 
GAGGACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCA 
TGGAAGTGCGGTTCTTCAGGGGCCAGTTCTCTAGCGTGGTCCACCTCTACAG 
GGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATCAAGGCAGGAC 

1 0 A A A ACTGGTG A AGG ATTCTATTGCGG AGGGGCGC ATCTCTCTG AGGCTGGA A 
AACATTACTGTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTC 
TTACTACCAGAAGGCCATCTGGGAGCTACAGGTGTCAGCACTGGGCTCAGTT 
CCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTACTCTGTCA 
GTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGA 

1 5 CAGG ATTTGTCC ACAG ACTCC AGG ACAA AC AG AG ACATGC ATGGCCTGTTTG 
ATGTGGAGATCTCTCTGACCGTCCAAGAGAACGCCGGGAGCATATCCTGTTC 
CATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAGG 
AGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGA 
ATACTCTGCTGTGGCCTATTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCC 

20 AAATTCCAGTGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGA 
CAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGAT 
CCAGAGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCC 
ATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGATTTACAAGGA 
AGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTACTGGGAGGT 

25 GGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGT 
GGACAGGAGGAAGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTC 
CTCAGACTGAATGGAGAACATTTGTATTTCACATTAAATCCCCGTTTTATCAG 
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CGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTATGAG 

TGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCT 

GACATGTCGGTTTGAAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATA 

ATGAGCAAAATGGAACTCCCAGAGACAAGCAACAGTGAGTCCTCCTCACAG 

GCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCA 

CATTCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGC 

AGCCGGCCAAGGTGGCTTCCAGATGAAGGGGGACTGGCCTGTCCACATGGG 

AGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGGCTGACATTAC 

ATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTC 

AGGTGAAGAACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAG 

TAGACAAGGAATGTGAATAATGCTTAGATCTTATTGATGACAGAGTGTATCC 

TAATGGTTTGTTCATTATATTACACTTTCAGTAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAA (SEQ ID NO: 11 7), and/or 

ATGGCTCTAATGCTCAGTTTGGTTCTGAGTCTCCTCAAGCTGGGATCAGGGCA 

GTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAGGAC 

GCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAG 

TGCGG1TCTTCAGGGGCCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGG 

GAAGGACCAGCCATTTATGCAGATGCCACAGTATCAAGGCAGGACAAAACT 

GGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATT 

ACTGTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTA 

CCAGAAGGCCATCTGGGAGCTACAGGTGTCAGCACTGGGCTCAGTTCCTCTC 

ATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTACTCTGTCAGTCCTC 

GGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGA 

TTTGTCCACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTG 

GAGATCTCTCTGACCGTCCAAGAGAACGCCGGGAGCATATCCTGTTCCATGC 

GGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAGGAGATA 

CCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTC 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

152 

TGCTGTGGCCTATTTTTTGGCATTGTTG^ 

CAGTGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGACAGGCA 

GAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAG 

ACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAA 

AAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGT 

GGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTACTGGGAGGTGGACGGA 

GGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGG 

AGGAAGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGAC 

TGAATGGAGAACATTTGTATTTCACATTAAATCCCCGTTTTATCAGCGTCTTC 

CCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTATGAGTGTGGGA 

CCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGT 

CGGTTTGAAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGC 

AAAATGGAACTCCCAGAGACAAGCAACAGTGA (SEQ ID NO: 118). Polypeptide 

encoded by these polynucleotides are also provided. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEV 

RFFRGQFSSVVHLYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVL 

DAGLYGCRISSQSYYQKAIWELQVSALGSVPLISITGYVDRDIQLLCQSSGWFPRP 

TAKWKGPQGQDLSTDSRTNRDMHGLFDVEISLTVQENAGSISCSMRHAHLSREV 

ESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSKFQWKIQAELDWRR 

KHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRFT 

RKSVVASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGY 

WVLRLNGEHLYFTLNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTC 

RFEGLLRPYIEYPSYNEQNGTPRDKQQ (SEQ ID NO: 119). Polynucleotides encoding 

these polypeptides are also provided. 
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A preferred polynucleotide splice variant of the invention comprises the 
following nucleic acid sequence: 

ACCTITTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACT 
CTGCTGTGGCCTATTTTTTGGCATTGTTGGACT 

CCAGTGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGACAGGC 

AGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGA 

GACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGA 

AAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGATTTACAAGGAAGAGT 

GTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTACTGGGAGGTGGACG 

GAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACA 

GGAGGAAGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAG 

ACTGAATGGAGAACATTTGTATTTCACATTAAATCCCCGTTTTATCAGCGTCT 

TCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTATGAGTGTGG 

GACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACAT 

GTCGGTTTGAAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAG 

CAAAATGGAACTCCCAGAGACAAGCAACAGTGAGTCCTCCTCACAGGCAAC 

CACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACATTC 

TTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCG 

GCCAAGGTGGCTTCCAGATGAAGGGGGACTGGCCTGTCCACATGGGAGTCA 

GGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGGCTGACATTACATTTAG 

TTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGA 

AGAACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACA 

AGGAATGTGAATAATGCTTAGATCTTATTGATGACAGAGTGTATCCTAATGG 

TTTGTTCATTATATTACACTTTCAGTAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAA AAA (SEQ ID NO: 120). Polypeptides encoded by these polynucleotides are also 

provided. 
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Figures 22A-D show the nucleotide (SEQ ID NO: 19) and deduced amino acid 
sequence (SEQ ID NO:37) of BBIR II. Predicted amino acids from about 1 to about 17 
constitute the predicted signal peptide (amino acid residues from about 1 to about 17 in 
SEQ ID NO:37) and are represented by the underlined amino acid regions. 
5 Figure 23 shows the regions of similarity between the amino acid sequences of 

the Butyrophlin and B7-like lgG superfamily receptor (BBIR II) protein (SEQ ID 
NO:37) and the bovine butyrophilin precursor (SEQ ID NO: 121) 

Figure 24 shows an analysis of the integrin alpha 1 1 subunit (BBIR II) amino 
acid sequence. 

10 Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; 

amphipathic regions; flexible regions; antigenic index and surface probability are shown. 

A polynucleotide encoding a polypeptide of the present invention is obtained 
from human small intestine, colon tumor, and human testes tumor cells and tissues. The 
polynucleotide of this invention was discovered in a human testes tumor cDNA library. 

15 Its translation product has homology to the B30.2-like domain which is 

characteristic of proteins containing zinc-binding B-box motifs, and particularly for 
butyrophilin family members. The polynucleotide contains an open reading frame 
encoding the BBIR II polypeptide of 3 18 amino acids. BBIR II exhibits a high degree of 
homology at the amino acid level to the bovine butyrophilin precursor (as shown in 

20 Figure 23). The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the BBIR II polypeptide having the amino acid sequence shown 
in Figures 22A-D (SEQ ED NO:37). The nucleotide sequence shown in Figures 22A-D 
(SEQ ID NO: 19) was obtained by sequencing a cloned cDNA (HTTDB46), which was 
deposited on November 17 at the American Type Culture Collection, and given 

25 Accession Number 203484. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
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nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO: 19 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
5 course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO:19. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 

10 shown in SEQ ID NO: 19. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of BBIR II polynucleotide fragments of the invention 
include, for example, fragments that comprise, or alternatively, consist of, a sequence 
from about nucleotide 1 to about 50, from about 51 to about 100, from about 101 to 

15 about 150, from about 151 to about 200, from about 201 to about 250, from about 251 to 
about 300, from about 301 to about 350, from about 351 to about 400, from about 401 to 
about 450, from about 451 to about 500, from about 501 to about 550, from about 551 to 
about 600, from about 601 to about 650, from about 651 to about 700, from about 701 to 
about 750, from about 751 to about 800, from about 801 to about 850, from about 851 to 

20 about 900, from about 901 to about 950, from about 951 to about 1000, from about 1001 
to about 1050, from about 1051 to about 1 100, from about 1 101 to about 1 150, from 
about 1151 to about 1200, from about 1201 to about 1250, from about 1251 to about 
1300, from about 1301 to about 1350, from about 1351 to about 1400, from about 1401 
to about 1450, from about 1451 to about 1500, from about 1501 to about 1550, from 

25 about 1551 to about 1600, from about 1601 to about 1650, from about 1651 to about 
1700, from about 1701 to about 1750, from about 1751 to about 1800, from about 1801 
to about 1850, from about 1851 to about 1900, from about 1901 to about 1950, from 
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about 1951 to about 2000, from about 2001 to about 2050, from about 2051 to about 
2100, from about 2101 to about 2150, from about 2151 to about 2200, from about 2201 
to about 2250, from about 2251 to about 2300, from about 2301 to about 2350, from 
about 2351 to about 2400, from about 2401 to about 2450, from about 2451 to about 
5 2500, from about 2501 to about 2550, from about 2551 to about 2600, from about 2601 
to about 2650, from about 2651 to about 2700, from about 2701 to about 2750, from 
about 2751 to about 2800, from about 2801 to about 2850, from about 2851 to about 
2900, from about 2901 to about 2950, from about 295 1 to about 3000, from about 3001 
to about 3050, from about 3051 to about 3059 of SEQ ID NO: 19, or the complementary 

10 strand thereto, or the cDNA contained in the deposited gene. In this context "about" 
includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1 ) 
nucleotides, at either terminus or at both termini. 

Preferred nucleic acid fragments of the present invention include nucleic acid 
molecules encoding a member selected from the group: a polypeptide comprising or 

15 alternatively, consisting of, the mature BBIR II protein (amino acid residues from about 
18 to about 318 in Figures 22A-D (amino acids from about 18 to about 318 in SEQ ID 
NO:37). Since the location of this form of the protein has been predicted by computer 
analysis, one of ordinary skill would appreciate that the amino acid residues constituting 
these domains may vary slightly (e.g., by about 1 to 15 amino acid residues) depending 

20 on the criteria used to define this location. In additional embodiments, the 
polynucleotides of the invention encode functional attributes of BBIR II. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 

25 regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 

hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions of BBIR II. The data 
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representing the structural or functional attributes of BBIR II set forth in Figure 24 
and/or Table VIII, as described above, was generated using the various modules and 
algorithms of the DNA*STAR set on default parameters. In a preferred embodiment, the 
data presented in columns VIII, IX, XIII, and XIV of Table VIII can be used to 
determine regions of BBIR II which exhibit a high degree of potential for antigenicity. 
Regions of high antigenicity are determined from the data presented in columns VIII, IX, 
XIII, and/or XIV by choosing values which represent regions of the polypeptide which 
are likely to be exposed on the surface of the polypeptide in an environment in which 
antigen recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 24, but may, as 
shown in Table VIII, be represented or identified by using tabular representations of the 
data presented in Figure 24. The DNA*STAR computer algorithm used to generate 
Figure 24 (set on the original default parameters) was used to present the data in Figure 
24 in a tabular format (See Table VIII). The tabular format of the data in Figure 24 is 
used to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 24 and in Table VIII include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figures 22A-D. As set out in Figure 24 and in Table VIII, such preferred regions 
include Garnier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, 
Chou-Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic 
regions and Hopp-Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic 
regions, Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index 
and Emini surface-forming regions. Even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened BBIR II 
muteins to induce and/or bind to antibodies which recognize the complete or mature 
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forms of the polypeptides generally will be retained when less than the majority of the 
residues of the complete or mature polypeptide are removed from the N-terminus. 
Whether a particular polypeptide lacking N-terminal residues of a complete polypeptide 
retains such immunologic activities can readily be determined by routine methods 
described herein and otherwise known in the art. It is not unlikely that an BBIR II mutein 
with a large number of deleted N-terminal amino acid residues may retain some 
biological or immunogenic activities. In fact, peptides composed of as few as six BBIR U 
amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the BBIR II amino acid sequence 
shown in Figures 22A-D, up to the cystein residue at position number 313 and 
polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues n 1-3 18 of Figures 22A-D, 
where nl is an integer from 2 to 313 corresponding to the position of the amino acid 
residue in Figures 22A-D (which is identical to the sequence shown as SEQ ID NO:37). 
In another embodiment, N-terminal deletions of the BBIR II polypeptide can be 
described by the general formula n2-318, where n2 is a number from 2 to 313, 
corresponding to the position of amino acid identified in Figures 22A-D. N-terminal 
deletions of the BBIR II polypeptide of the invention shown as SEQ ID NO:37 include 
polypeptides comprising the amino acid sequence of residues: N-terminal deletions of the 
BBIR II polypeptide of the invention shown as SEQ ID NO:37 include polypeptides 
comprising the amino acid sequence of residues: A-2 to T-318; L-3 to T-318; M-4 to T- 
318; L-5 to T-318; S-6to T-318; L-7 to T-318; V-8 to T-318; L-9 to T-318; S-10 to T- 
318; L-ll to T-318; L-12 to T-318; K-13 toT-318; L-14 to T-318; G-15 to T-318; S-16 
to T-318; G-17 to T-318; Q-18 to T-318; W-19 to T-318; Q-20 toT-318; V-21 to T-318; 
F-22 to T-318; G-23 to T-318; P-24 to T-318; D-25 to T-318; K-26 to T-318; P-27 toT- 
318; V-28 to T-318; Q-29 to T-318; A-30 to T-318; L-3 1 to T-318; V-32 to T-318; G-33 
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to T-318; E-34 toT-318; D-35 toT-318; A-36 toT-318; A-37 to T-318; F-38 to T-318; 
S-39 to T-318; C-40 to T-318; F-41 toT-318; L-42 to T-318; S^3 to T-318; P^4 to T- 
318; K-45 to T-318; T-46 to T-318; N-47 to T-318; A-48 toT-318; E-49 to T-318; A-50 
to T-318; M-51 to T-318; E-52 to T-318; V-53 to T-318; R-54 to T-318; F-55 toT-318; 
F-56 to T-318; R-57 to T-318; G-58 to T-318; Q-59 to T-318; F-60 to T-318; S-61 to T- 
318; S-62 toT-318: V-63 to T-318; V-64 to T-318; H-65 to T-318; L-66 to T-318; Y-67 
• to T-318; R-68 to T-318; D-69 toT-318; G-70 to T-318; K-71 to T-318; D-72 to T-318; 
Q-73 to T-318; P-74 to T-318; F-75 to T-318; M-76 toT-318; Q-77 to T-318; M-78 to T- 
318: P-79 to T-318; Q-80 to T-318; Y-81 to T-318; Q-82 to T-318; G-83 toT-318; R-84 
to T-318; T-85 to T-318; K-86 to T-318; L-87 to T-318; V-88 to T-318: K-89 to T-318; 
D-90 toT-318; S-91 to T-318; 1-92 to T-318; A-93 to T-318; E-94 to T-318; G-95 to T- 
318; R-96 to T-318: 1-97 toT-318; S-98 to T-318; L-99 to T-318; R-100 to T-318; L-101 
to T-318; E-102 to T-318; N-103 toT-318;I-104 to T-318; T-105 to T-318; V-106 toT- 
318; L-107 to T-318; D-108 to T-318; A-109 to T-318; G-110toT-318: L-lll to T-318; 
Y-112 to T-318; G-113 to T-318; C-114 to T-318; R-l 15 to T-318; 1-116 toT-318;S-l 17 
to T-318; S-118 to T-318; Q-119 to T-318; S-120 to T-318; Y-121 to T-318; Y-122 to T- 
318; Q-123 toT-318; K-124 to T-318; A-125 to T-318; 1-126 to T-318; W-127 to T-318; 
= E-128 to T-318: L-129 to T-318;Q-130 to T-318; V-131 to T-318; S-132 to T-318; A- 
-133 to T-318; L-134 to T-318; G-135 to T-318; S-136 toT-318; V-137 to T-318; P-138 
to T-318; L-139 to T-318; 1-140 to T-318; S-141 to T-318; 1-142 to T-318;A-143 to T- 
318; G-144 to T-318; Y-145 to T-318; V-146 to T-318; D-147 to T-318; R-148 to T-318; 
D-149to T-318; 1-150 to T-318; Q-151 to T-318; L-152 to T-318; L-153 to T-318; C-154 
to T-318; Q-155 to T-318;S-156 to T-318: S-157 to T-318; G-158 to T-318; W-159 to T- 
318; F-160 to T-318; P-161 to T-318; R-162to T-318; P-163 to T-318; T-164 to T-318; 
A-165 to T-318; K-166 to T-318; W-167 to T-318; K-168 toT-318; G-169 to T-318; P- 
170 to T-318; Q-171 to T-318; G-172 to T-318; Q-173 to T-318; D-174 to T-318;L-175 
to T-318; S-176 to T-318; T-177 to T-318; D-178 to T-318; S-179 to T-318; R-180 to T- 
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318; T-181 toT-318; N-182 toT-318; R-183 to T-318; D-184 to T-318; M-185 to T-318; 
H-186 to T-318; G-187 to T-318;L-188 to T-318; F-189 to T-318; D-190 to T-318; V- 
191 to T-318; E-192 to T-318; 1-193 to T-318; S-194 toT-318; L-195 to T-318; T-196 to 
T-318; V-197 to T-318; Q-198 to T-318; E-199 to T-318; N-200 to T-318;A-20I to T- 
318; G-202 to T-3 1 8; S-203 to T-318; 1-204 to T-318; S-205 to T-318; C-206 to T-3 1 8; 
S-207 toT-318; M-208 to T-318; R-209 to T-318; H-210to T-318; A-21I to T-318; H- 
2 12 to T-3 1 8; L-2 1 3 to T-3 1 8;S-214 to T-318; R-2 1 5 to T-3 1 8; E-2 1 6 to T-3 18; V-217 
to T-318; E-218 to T-318; S-219 to T-318: R-220 toT-318; V-221 to T-318; Q-222 to T- 
318; 1-223 to T-3 18; G-224 to T-318; D-225 to T-318; W-226 toT-318;R-227 to T-318; 
R-228 to T-318; K-229 to T-318; H-230 to T-318; G-231 to T-318; Q-232 to T-318; A- 
233to T-318; G-234 to T-318; K-235 to T-318; R-236 to T-318; K-237 to T-318; Y-238 
to T-318; S-239 toT-318; S-240 to T-318; S-241 to T-318; H-242 to T-318; 1-243 to T- 
318; Y-244 to T-318; D-245 to T-318;S-246 to T-318; F-247 to T-318; P-248 to T-318; 
S-249 to T-318; L-250 to T-318; S-251 to T-318; F-252 toT-318; M-253 to T-318: D- 
254 to T-318; F-255 to T-318; Y-256 to T-318; 1-257 to T-318; L-258 to T-3 18;R-259 to 
T-318; P-260 to T-318; V-261 to T-318; G-262 to T-318; P-263 to T-318; C-264 to T- 
318; R-265to T-318; A-266 to T-318; K-267 to T-318; L-268 to T-318; V-269 to T-318; 
M-270 to T-318; G-271 toT-318; T-272 to T-318; L-273 to T-318; K-274 to T-318; L- 
275 to T-318; Q-276 to T-318; 1-277 to T-318;L-278 to T-318; G-279 to T-318; E-280 to 
T-318; V-281 to T-318; H-282 to T-318; F-283 to T-318; V-284to T-318; E-285 to T- 
318; K-286 to T-318; P-287 to T-318; H-288 to T-318; S-289 to T-318; L-290 toT-318; 
L-291 to T-318; Q-292 to T-318; 1-293 to T-318; S-294 to T-318; G-295 to T-318: G- 
296 to T-318;S-297 to T-318; T-298 to T-318; T-299 to T-318; L-300 to T-318; K-301 to 
T-318; K-302 to T-318; G-303to T-318; P-304 to T-318; N-305 to T-318; P-306 to T- 
318; W-307 to T-318; S-308 to T-318; F-309 toT-318; P-310 to T-318; S-31 1 to T-318; 
P-312 to T-318; C-313 to T-318; of SEQ ID NO:37. Polypeptides encoded by these 
polynucleotides are also encompassed by the invention. 
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Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities (e.g., ability 
to illicit mitogenic activity, induce differentiation of normal or malignant cells, ability to 
5 multimerize, etc.) may still be retained. For example the ability of the shortened BBIR II 
mutein to induce and/or bind to antibodies which recognize the complete or mature forms 
of the polypeptide generally will be retained when less than the majority of the residues 
of the complete or mature polypeptide are removed from the C-terminus. Whether a 
particular polypeptide lacking C-terminal residues of a complete polypeptide retains such 

10 immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that an BBIR II mutein with a large 
number of deleted C-terminal amino acid residues may retain some biological or 
immunogenic activities. In fact, peptides composed of as few as six BBIR II amino acid 
residues may often evoke an immune response. 

15 Accordingly, the present invention further provides polypeptides having one or 

more residues deleted from the carboxy terminus of the amino acid sequence of the 
BBIR II polypeptide shown in Figures 22A-D, up to the serine residue at position 
number 6, and polynucleotides encoding such polypeptides. In particular, the present 
invention provides polypeptides comprising the amino acid sequence of residues I -ml of 

20 Figure 1, where ml is an integer from 6 to 318 corresponding to the position of the 

amino acid residue in Figures 22A-D. Moreover, the invention provides polynucleotides 
encoding polypeptides comprising, or alternatively consisting of, the amino acid 
sequence of C-terminal deletions of the BBIR II polypeptide of the invention shown as 
SEQ ID NO:37 include polypeptides comprising the amino acid sequence of residues M- 

25 1 to P-317; M-l toF-316; M-l to L-3I5;M-1 to A-314; M-l toC-313; M-l to P-3I2; M- 
1 to S-31 i; M-l to P-310; M-l to F-309; M-l to S-308; M-lto W-307; M-l to P-306; M- 
1 to N-305; M-l to P-304; M-l to G-303; M-l to K-302; M-l to K-301; M-l toL-300; 
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M-l to T-299; M-l to T-298; M-l to S-297; M-l to G-296; M-l to G-295; M-l to S-294; 
M-l toI-293; M-l to Q-292; M-l to L-291; M-l to L-290; M-l to S-289; M-l to H-288; 
M-l to P-287; M-l toK-286; M-l to E-285; M-l to V-284; M-l to F-283; M-l to H-282; 
M-l to V-281; M-l to E-280; M-l toG-279; M-l to L-278; M-l to 1-277; M-l to Q-276; 
M-l to L-275; M-l to K-274; M-l to L-273; M-l toT-272; M-l to G-271; M-l to M-270; 
M-l to V-269; M-l to L-268; M-l to K-267; M-l to A-266; M-l toR-265; M-l to C-264; 
M-l to P-263; M-l to G-262; M-l to V-261; M-l to P-260; M-l to R-259; M-l toL-258; 
M-l to 1-257; M-l to Y-256; M-l to F-255; M-l to D-254; M-l to M-253; M-l to F-252; 
M-l toS-251; M-l to L-250; M-l to S-249; M-l to P-248; M-l to F-247; M-l to S-246; 
M-l to D-245; M-l toY-244; M-l to 1-243; M-l to H-242; M-l to S-241; M-l to S-240; 
M-l to S-239; M-l to Y-238; M-l toK-237; M-l to R-236; M-I to K-235; M-l to G-234; 
M-l to A-233; M-l to Q-232; M-l to G-231; M-l toH-230; M-l to K-229; M-l to R- 
228; M-l to R-227; M-l to W-226; M-l to D-225; M-l to G-224; M-l toI-223; M-l to 
Q-222; M-l to V-221; M-l to R-220; M-l to S-219; M-l to E-218; M-l to V-217; M-l 
toE-216; M-l toR-215;M-l toS-214; M-l toL-213;M-l toH-212; M-l to A-211;M-1 
to H-210; M-l toR-209; M-l to M-208; M-l to S-207; M-l to C-206; M-l to S-205; M-l 
to 1-204; M-l to S-203; M-l toG-202: M-l to A-201; M-l to N-200; M-l to E-199; M-l 
to Q-198; M-l to V-197; M-l toT-196; M-l toL-195; M-l to S-194; M l to 1-193; M-l 
to E-192; M-l to V-191; M-l to D-190; M-l to F-189; M-l toL-188; M-l to G-187; M-l 
toH-186; M-l toM-185;M-l toD-184;M-l toR-183; M-l toN-182; M-l toT-181; M- 
1 to R-180; M-l to S-179; M-l to D-178; M-l to T-177; M-l to S-176; M-l to L-175; 
M-l toD-174; M-l to Q-173; M-l to G-172; M-l to Q-171; M-l to P-170; M-l to G-169; 
M-l to K-168; M-l toW-167; M-l to K-166; M-l to A-165; M-l to T-164; M-l to P- 
163; M-l to R-162; M-l to P-161; M-l toF-160; M-l to W-159; M-l to G-158: M-l to 
S-157;M-1 toS-156; M-l toQ-155;M-l toC-154; M-l loL-153;M-l toL-152; M-l to 
Q-151; M-l to 1-150; M-l to D-149; M-l to R-148; M-l to D-147; M-1 toV-146; M l to 
Y-145; M-l to G-144; M-l to A-143; M-l to 1-142; M-l to S-141; M-l to 1-140; M-l 
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toL-139; M-l to P-138; M-l to V-137; M-l to S-136; M-l to G-135; M-l to L-134; M-l 
to A-133; M-l toS-132; M-l to V-131; M-l to Q-130; M-l to L-129; M-l to E-128; M-l 
to W-127; M-l to 1-126; M-l toA-125; M-l to K-124; M-l to Q-123; M-l to Y-122; M-l 
to Y-121; M-l to S-120; M-l to Q-l 19; M-l toS-118; M-l to S-l 17; M-l to 1-1 16; M-l 
5 to R-l 15; M-l to C-l 14; M-l to G-l 13; M-l to Y-l 12; M-l toL-1 1 1; M-l to G-l 10; M-l 
to A-109; M-l to D-108; M-l to L-107; M-l to V-106; M-l to T-105; M-l tol-104; M-l 
to N-103: M-l toE-102; M-l to L-101; M-l to R-100; M-l to L-99; M-l to S-98; M-l to 
I-97;M-1 to R-96; M-l to G-95; M-l to E-94; M-l to A-93; M-l to 1-92; M-l to S-91; 
M-l to D-90; M-l to K-89;M-1 to V-88; M-l to L-87; M-l to K-86; M-l to T-85; M-l to 

10 R-84; M-l to G-83; M-l to Q-82; M-l to Y-81;M-1 to Q-80; M-l to P-79; M-l to M-78; 
M-l to Q-77; M-l to M-76; M-l to F-75; M-l to P-74; M-l to Q-73;M-1 to D-72; M-l to 
K-71: M-l to G-70; M-l to D-69; M-l to R-68; M-l to Y-67; M-l to L-66; M-l to H- 
65;M-1 to V-64; M-l to V-63; M-l to S-62; M-l to S-61 ; M-l to F-60; M-l to Q-59; M- 
1 to G-58; M-l to R-57;M-1 to F-56; M-l to F-55; M-l to R-54; M-l to V-53; M-l to E- 

15 52; M-l to M-51; M-l to A-50; M-l to E-49;M-1 to A-48; M-l to N-47; M-l to TA6; 
M-l to K-45; M-l to P-44; M-l to S-43; M-l to L-42; M-l to F-41;M-1 to C-40; M-l to 
S-39; M-l to F-38; M-l to A-37; M-l to A-36; M-l to D-35; M-l to E-34; M-l to G- 
33;M-1 to V-32; M-l to L-31; M-l to A-30; M-l toQ-29; M-l to V-28; M-l to P-27; M- 
1 to K-26; M-l to D-25;M-1 to P-24; M-l to G-23; M-l to F-22; M-l to V-21; M-l to Q- 

20 20; M-l to W-19; M-l to Q-18; M-l to G-17;M-1 to S-16; M-l to G-15; M-l to L-14; 
M-l to K-13; M-l to L-12; M-l to L-ll; M-l to S-10; M-l to L-9;M-1 to V-8; M-l to L- 
7; M-l to S-6; of SEQ ID NO:37. Polypeptides encoded by these polynucleotides are 
also encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 

25 sequences related to extensive portions of SEQ ID NO:19 which have been determined 
from the following related cDNA genes: HTTDB46R (SEQ ID NO: 122), and HSIEA44R. 
(SEQ ID NO: 123). 
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Based on the sequence similarity to the bovin butyrophilin precursor, translation 
product of this gene is expected to share at least some biological activities with B30.2- 
like domain containing proteins, and specifically butyrophilin proteins. Such activities 
are known in the art, some of which are described elsewhere herein. Specifically, 
5 polynucleotides and polypeptides of the invention are also useful for modulating the 
differentiation of normal and malignant cells, modulating the proliferation and/or 
differentiation of cancer and neoplastic cells, and regulation of cell growth and 
differentiation. Polynucleotides and polypeptides of the invention may represent a 
diagnostic marker for breast diseases and/or disorders, in addition to disorders of 
10 secretory organs and tissues (which include, testicular and gastrointestinal disorders, 
particularly those cells which serve secretory functions for seminal fluid or 
gastrointestinal hormones, and disorders of the mucosal membranes of such cells and 
tissues, etc.). 

The full-length protein should be a secreted protein, based upon homology to the 
15 butyrophilin family of proteins. Therefore, it is secreted into milk, serum, urine, seminal 
fluid, or feces and thus the levels is assayable from patient samples. Assuming specific 
expression levels are reflective of the presence of breast disorders (i.e., breast cancer, 
breast dysfunction, etc.) this protein would provide a convenient diagnostic for early 
detection of such disorders 
20 In addition, expression of this gene product may also be linked to the progression 

of breast diseases, and therefore may itself actually represent a therapeutic or therapeutic 
target for the treatment of breast cancer. Polynucleotides and polypeptides of the 
invention may play an important role in the pathogenesis of human cancers and cellular 
transformation, particularly those of secretory cells and tissues. Polynucleotides and 
25 polypeptides of the invention may also be involved in the pathogenesis of developmental 
abnormalities based upon its potential effects on proliferation and differentiation of cells 
and tissue cell types. 
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Due to the potential proliferating and differentiating activity of said 
polynucleotides and polypeptides, the invention is useful as a therapeutic agent in 
inducing tissue regeneration, for treating inflammatory conditions. Moreover, the 
invention is useful in modulating the immune response to aberrant polypeptides, as may 
5 exist in rapidly proliferating cells and tissue cell types, particularly in cancers. The 
invention, including agonists and/or antagonists thereof, is useful in modulating the 
nutritional value of milk, its caloric content, its fat content, and may conceivably be 
useful in mediating the adaption of breast secretory function as a delivery vehicle for 
therapeutics (i.e., transgenic breast secretory tissue for transferring therapeutically active 

10 proteins to infants). 

Alternatively, the expression within cellular sources marked by proliferating cells 
indicates this protein may play a role in the regulation of cellular division, and may show 
utility in the diagnosis, treatment, and/or prevention of developmental diseases and 
disorders, including cancer, and other proliferative conditions. Representative uses are 

15 described in the "Hyperproliferative Disorders" and "Regeneration" sections below and 
elsewhere herein. Briefly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell death, 
as occurs in the development of some cancers, or in failure to control the extent of cell 

20 death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 

Alternatively, this gene product is involved in the pattern of cellular proliferation 
that accompanies early embryogenesis. Thus, aberrant expression of this gene product in 
tissues - particularly adult tissues - may correlate with patterns of abnormal cellular 

25 proliferation, such as found in various cancers. Because of potential roles in proliferation 
and differentiation, this gene product may have applications in the adult for tissue 
regeneration and the treatment of cancers. It may also act as a morphogen to control cell 
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and tissue type specification. Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus this protein may 
modulate apoptosis or tissue differentiation and is useful in the detection, treatment, 
5 and/or prevention of degenerative or proliferative conditions and diseases. The protein is 
useful in modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the protein 
may also be used to determine biological activity, to raise antibodies, as tissue markers, 
10 to isolate cognate ligands or receptors, to identify agents that modulate their interactions, 
in addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. 

This gene is expressed primarily in small intestine, colon tumor, and to a lesser 

15 extent in human testes tumor cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, gastrointestinal diseases and/or disorders, in addition to lactation disorders, 

20 and tumors of the testes. Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune and reproductive systems, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 

25 (e.g. immune, testicular, gastrointestinal, and cancerous and wounded tissues) or bodily 
fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue 
or cell sample taken from an individual having such a disorder, relative to the standard 
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gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 37 as residues: Tyr-67 to Pro-74, Ser-1 17 to Gln-123, Pro-161 to 
Met-185, Gly-224 to His-242, Thr-299 to Trp-307. Polynucleotides encoding said 
polypeptides are also provided. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

10 

The translation product of this gene contains a serine protease motif and 
accordingly is believed to possess serine protease activity. Assays for determining such 
activity are well known in the art. Preferred polypeptides of this invention possess such 
activity. 

15 Included in this invention as preferred domains are serine protease histidine 

active site domains, which were identified using the ProSite analysis tool (Swiss Institute 
of Bioinformatics). The catalytic activity of the serine proteases from the trypsin family 
is provided by a charge relay system involving an aspartic acid residue hydrogen- bonded 
to a histidine, which itself is hydrogen-bonded to a serine. The sequences in the vicinity 

20 of the active site serine and histidine residues are well conserved in this family of 
proteases [1]. Consensus pattern: [LIVM]-[ST]-A-[STAG]-H-C , H is the active site 
residue. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: GTLVAEKHVLTAAHCIHDGKTYVKGTQ (SEQ ID NO: 124). 
25 Polynucleotides encoding these polypeptides are also provided. 

Further preferred are polypeptides comprising the serine protease histidine active 
site domain of the sequence referenced in Table XIII for this gene, and at least 5, 10, 15, 
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20, 25, 30, 50, or 75 additional contiguous amino acid residues of this referenced 
sequence. The additional contiguous amino acid residues is N-terminal or C- terminal to 
the serine protease histidine active site domain. 

Alternatively, the additional contiguous amino acid residues is both N-terminal 
and C-terminal to the serine protease histidine active site domain, wherein the total N- 
and C-terminal contiguous amino acid residues equal the specified number. The above 
preferred polypeptide domain is characteristic of a signature specific to serine protease 
proteins. Based on the sequence similarity, the translation product of this gene is 
expected to share at least some biological activities with serine proteases. Such activities 
are known in the art, some of which are described elsewhere herein. 

In another embodiment, polypeptides comprising the amino acid sequence of the 
open reading frame upstream of the predicted signal peptide are contemplated by the 
present invention. Specifically, polypeptides of the invention comprise the following 
amino acid sequence: 

GTRGQAWEPRALSRRPHLSERRSEPRPGRAARRGTVLGMAGIPGLLFLLFF 

LLCAVGQVSPYSAPWKPTWPAYRLPVVLPQSTLNLAKPDFGAEAKLEVSSSCGP 

QCHKGTPLPTYEEAKQYLSYETLYANGSRTETQVGIYILSSSGDGAQHRDSGSSG 

KSRRKRQIYGYDSRFSIFGKDFLLNYPFSTSVKLSTGCTGTLVAEKHVLTAAHC1 

HDGKTYVKGTQKLRVGFLKPKFKDGGRGANDSTSAMPEQMKFQWIRVKRTHV 

PKGWIKGNANDIGMDYDYALLELKKPHKRKFMKIGVSPPAKQLPGGRIHFSGYD 

NDRPGNLVYRFCDVKDETYDLLYQQCDSQPGASGSGVYVRMWKRQHQKWER 

KIIGMISGHQWVDMDGSPQEFTRGCSEITPLQYIPDISIGV (SEQ ID NO: 125). 

Polynucleotides encoding these polypeptides are also provided. 

A preferred polypeptide variant of the invention comprises the following amino 
acid sequence: 

MAGIPGLLFLLFFLLCAVGQVSPYSAPWKPTWPAYRLPVVLPQSTLNLAKPD 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

169 

FGAEAKLEVSSSCGPQCHKGTPLPTYEEAKQYLSYETLYANGSRTETQVGIYILS 

SSGDGAQHRDSGSSGKSRRKRQIYGYDSRFSIFGKDFLLNYPFSTSVKLSTGCTG 
TLVAEKHVLT 

AAHCIHDGKTYVKGTQKLRVGFLKPKFKDGGRGANDSTSAMPEQMKFQWIRV 
KRTHVPKGWIKGNANDIGMDYDYALLELKKPHKRKFMKIGVSPPAKQLPGGRI 
HFSGYDNDRPGNLVYRFCDVKDETYDLLYQQCDAQPGASGSGVYVRMWKRQ 

QQKWERKIIGIFSGHQWVDMNGSPQDFNVAVRITPLKYAQICYWIKGNYLDCRE 
G (SEQ ID NO: 126). Polynucleotides encoding these polypeptides are also provided. 

Figures 25 A-B show the nucleotide (SEQ ID NO:20) and deduced amino acid 
sequence (SEQ ID NO:38) of the present invention. Predicted amino acids from about 1 
to about 19 constitute the predicted signal peptide (amino acid residues from about I to 
about 19 in SEQ ID NO:38) and are represented by the underlined amino acid regions; 
amino acids from about 162 to about 188 constitutes the predicted serine protease 
histidine active site domain (amino acids residues from about 162 to about 188 in SEQ 
ID NO:38) and are represented by the double underlined amino acid regions; and amino 
acid residue 175 (amino acid residue 175 in SEQ ID NO:38) constitutes the predicted 
histidine active site residue and is represented by the bold amino acid. 

Figure 26 shows the regions of similarity between the amino acid sequences of 

the present invention SEQ ID NO:38, and the Human Pancreatic Elastase 2 protein 

(gi|219620)(SEQ ID NO: 127). 

Figure 27 shows an analysis of the amino acid sequence of SEQ ID NO:38. 

Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; amphipathic 

regions; flexible regions; antigenic index and surface probability are shown. 

Northern analysis indicates that this gene is expressed highest in HUVEC, 

HUVEC +LPS, smooth muscle, fibroblasts, present in heart, brain, placenta, lung, liver, 

muscle, kidney, pancreas, spleen, thymus, prostate, testes, ovary, small intestine, colon 

and weakly in PBLs. 
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The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 
Figures 25 A-B (SEQ ID NO:38), which was determined by sequencing a cloned cDNA. 
The nucleotide sequence shown in Figures 25 A-B (SEQ ID NO:20) was obtained by 
5 sequencing a cloned cDNA (HUS AQ05), which was deposited on Nov. 1 7, 1 998 at the 
American Type Culture Collection, and given Accession Number 203484. The deposited 
gene is inserted in the pSport plasmid (Life Technologies, Rockville, MD) using the 
Sall/NotI restriction endonuclease cleavage sites. 

The present invention is further directed to fragments of the isolated nucleic acid 

10 molecules described herein. By a fragment of an isolated DNA molecule having the 

nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO:20 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 

1 5 course, larger fragments 50- 1 500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO:20. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 

20 shown in SEQ ID NO:20. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of polynucleotide fragments of the invention include, 
for example, fragments that comprise, or alternatively, consist of, a sequence from about 
nucleotide 1 to about 50, from about 51 to about 100, from about 101 to about 150, from 

25 about 1 5 1 to about 200, from about 201 to about 250, from about 25 1 to about 300, from 
about 301 to about 350, from about 351 to about 400, from about 401 to about 450, from 
about 451 to about 500, and from about 501 to about 550, and from about 55 1 to about 
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600, from about 601 to about 650, from about 65 1 to about 700, from about 701 to about 
750, from about 751 to about 800, from about 801 to about 850, from about 851 to about 
900, from about 901 to about 950, from about 951 to about 1000, from about 1001 to 
about 1050, from about 1051 to about 1 100, from about 1 101 to about 1 150, from about 
5 1 151 to about 1200, from about 1201 to about 1250, from about 1251 to about 1300, 
from about 1301 to about 1350, from about 1351 to about 1400, from about 1401 to 
about 1450, from about 1451 to about 1500, from about 1501 to about 1550, from about 
1551 to about 1600, from about 1601 to about 1650, from about 1651 to about 1699 of 
SEQ ID NO:20, or the complementary strand thereto, or the cDNA contained in the 

10 deposited gene. In this context "about" includes the particularly recited ranges, larger or 
smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. In 
additional embodiments, the polynucleotides of the invention encode functional 
attributes of the corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 

15 comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions. The data representing 

20 the structural or functional attributes of the protein set forth in Figure 27 and/or Table IX, 
as described above, was generated using the various modules and algorithms of the 
DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XIV of Table IX can be used to determine regions of the 
protein which exhibit a high degree of potential for antigenicity. Regions of high 

25 antigenicity are determined from the data presented in columns VIII, IX, XIII, and/or 
XIV by choosing values which represent regions of the polypeptide which are likely to 
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be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 27, but may, as 
shown in Table IX, be represented or identified by using tabular representations of the 
data presented in Figure 27. The DNA*STAR computer algorithm used to generate 
Figure 27 (set on the original default parameters) was used to present the data in Figure 
27 in a tabular format (See Table IX). The tabular format of the data in Figure 27 is used 
to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 27 and in Table IX include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figure 1. As set out in Figure 27 and in Table IX, such preferred regions include 
Garnier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, Chou- 
Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic regions 
and Hopp-Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic regions, 
Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index and 
Emini surface-forming regions. Even if deletion of one or more amino acids from the N- 
terminus of a protein results in modification of loss of one or more biological functions 
of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N-terminus. Whether a particular 
polypeptide lacking N-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
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activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence shown in 
5 Figures 25A-B, up to the aspartic acid residue at position number 370 and 

polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues n 1-375 of Figures 25A-B, 
where nl is an integer from 2 to 370 corresponding to the position of the amino acid 
residue in Figures 25A-B (which is identical to the sequence shown as SEQ ID NO:38). 

10 N-terminal deletions of the polypeptide of the invention shown as SEQ ID NO:38 

include polypeptides comprising the amino acid sequence of residues: A-2 to V-375; G-3 
to V-375; 1-4 to V-375; P-5 to V-375; G-6 to V-375; L-7 to V-375; L-8 to V-375; F-9 to 
V-375; L-10 to V-375; L-l 1 to V-375; F-12 to V-375; F-13 to V-375; L-14 to V-375; L- 
15 to V-375; C-16 to V-375; A-17 to V-375; V-18 to V-375; G-19 to V-375; Q-20 to V- 

15 375; V-21 to V-375; S-22 to V-375; P-23 to V-375; Y-24 to V-375; S-25 to V-375; A-26 
to V-375; P-27 to V-375; W-28 to V-375; K-29 to V-375; P-30 to V-375; T-31 to V-375; 
W-32 to V-375; P-33 to V-375; A-34 to V-375; Y-35 to V-375; R-36 to V-375; L-37 to 
V-375; P-38 to V-375; V-39 to V-375; V-40 to V-375; L-41 to V-375; P-42 to V-375; Q- 
43 to V-375; S-44 to V-375; T-45 to V-375; L-46 to V-375; N-47 to V-375; L-48 to V- 

20 375; A-49 to V-375; K-50 to V-375; P-5 1 to V-375; D-52 to V-375; F-53 to V-375; G-54 
to V-375; A-55 to V-375; E-56 to V-375; A-57 to V-375; K-58 to V-375; L-59 to V-375; 
E-60 to V-375; V-61 to V-375; S-62 to V-375; S-63 to V-375; S-64 to V-375; C-65 to V- 
375; G-66 to V-375; P-67 to V-375; Q-68 to V-375; C-69 to V-375; H-70 to V-375; K- 
71 to V-375; G-72 to V-375; T-73 to V-375; P-74 to V-375; L-75 to V-375; P-76 to V- 

25 375; T-77 to V-375; Y-78 to V-375; E-79 to V-375; E-80 to V-375; A-81 to V-375; K-82 
to V-375; Q-83 to V-375; Y-84 to V-375; L-85 to V-375; S-86 to V-375; Y-87 to V-375; 
E-88 to V-375; T-89 to V-375; L-90 to V-375; Y-91 to V-375; A-92 to V-375; N-93 to 
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V-375; G-94 to V-375; S-95 to V-375; R-96 to V-375; T-97 to V-375; E-98 to V-375; T- 
99 to V-375; Q-100 to V-375; V-101 to V-375; G-102 to V-375; 1-103 to V-375; Y-104 
to V-375; 1-105 to V-375; L-106 to V-375; S-107 to V-375; S-108 to V-375; S-109 to V- 
375; G-l 10 to V-375; D-l 11 to V-375; G-l 12 to V-375; A-l 13 to V-375; Q-114 to V- 
5 375; H-115 to V-375; R-116 to V-375; D-l 17 to V-375; S-118 to V-375; G-119 to V- 
375; S-120 to V-375; S-121 to V-375; G-122 to V-375; K-123 to V-375; S-124 to V- 
375; R-125 to V-375; R-126 to V-375; K-127 to V-375; R-128 to V-375; Q-129 to V- 
375; 1-130 to V-375; Y-131 to V-375; G-132 to V-375; Y-133 to V-375; D-134 to V- 
375; S-135 to V-375; R-136 to V-375; F-137 to V-375; S-138 to V-375; 1-139 to V-375; 

10 F-140 to V-375; G-141 to V-375; K-142 to V-375; D-I43 to V-375; F-144 to V-375; L- 
145 to V-375; L-146 to V-375; N-147 to V-375; Y-148 to V-375; P-149 to V-375; F-150 
to V-375; S-151 to V-375; T-152 to V-375; S-153 to V-375; V-154 to V-375; K-155 to 
V-375; L-156 to V-375; S-157 to V-375; T-158 to V-375; G-159 to V-375; C-160 to V- 
375; T-161 to V-375; G-162 to V-375; T-163 to V-375; L-164 to V-375; V-165 to V- 

15 375; A-166 to V-375; E-167 to V-375; K-168 to V-375; H-169 to V-375; V-170 to V- 
375; L-171 to V-375; T-172 to V-375; A-173 to V-375; A-174 to V-375; H-175 to V- - 
375; C-176 to V-375; 1-177 to V-375; H-178 to V-375; D-179 to V-375; G-180 to V- 
375; K-181 to V-375; T-182 to V-375; Y-183 to V-375; V-184 to V-375; K-185 to V- 
375; G-186 to V-375; T-187 to V-375; Q-188 to V-375; K-189 to V-375; L-190 to V- 

20 375; R-191 to V-375; V-192 to V-375; G-193 to V-375; F-194 to V-375; L-195 to V- 
375; K-196 to V-375; P-197 to V-375; K-198 to V-375; F-199 to V-375; K-200 to V- 
375; D-201 to V-375; G-202 to V-375; G-203 to V-375; R-204 to V-375; G-205 to V- 
375; A-206 to V-375; N-207 to V-375; D-208 to V-375; S-209 to V-375; T-210 to V- 
375; S-21 1 to V-375; A-212 to V-375; M-213 to V-375; P-214 to V-375; E-215 to V- 

25 375; Q-216 to V-375; M-217 to V-375; K-218 to V-375; F-219 to V-375; Q-220 to V- 
375; W-221 to V-375; 1-222 to V-375; R-223 to V-375; V-224 to V-375; K-225 to V- 
375; R-226 to V-375; T-227 to V-375; H-228 to V-375; V-229 to V-375; P-230 to V- 
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375; K-231 to V-375; G-232 to V-375; W-233 to V-375; 1-234 to V-375; K-235 to V- 
375; G-236 to V-375; N-237 to V-375; A-238 to V-375; N-239 to V-375; D-240 to V- 
375; 1-241 to V-375; G-242 to V-375; M-243 to V-375; D-244 to V-375; Y-245 to V- 
375; D-246 to V-375; Y-247 to V-375; A-248 to V-375; L-249 to V-375; L-250 to V- 
5 375; E-251 to V-375; L-252 to V-375; K-253 to V-375; K-254 to V-375; P-255 to V- 
375; H-256 to V-375; K-257 to V-375; R-258 to V-375; K-259 to V-375; F-260 to V- 
375; M-261 to V-375; K-262 to V-375; 1-263 to V-375; G-264 to V-375; V-265 to V- 
375; S-266 to V-375; P-267 to V-375; P-268 to V-375; A-269 to V-375; K-270 to V- 
375; Q-271 to V-375; L-272 to V-375; P-273 to V-375; G-274 to V-375; G-275 to V- 

10 375; R-276 to V-375; 1-277 to V-375; H-278 to V-375; F-279 to V-375; S-280 to V-375; 
G-281 to V-375; Y-282 to V-375; D-283 to V-375; N-284 to V-375; D-285 to V-375; R- 
286 to V-375; P-287 to V-375; G-288 to V-375; N-289 to V-375: L-290 to V-375; V-291 
to V-375; Y-292 to V-375; R-293 to V-375; F-294 to V-375; C-295 to V-375; D-296 to 
V-375; V-297 to V-375; K-298 to V-375; D-299 to V-375; E-300 to V-375; T-301 to V- 

15 375; Y-302 to V-375; D-303 to V-375; L-304 to V-375; L-305 to V-375; Y-306 to V- 
375; Q-307 to V-375; Q-308 to V-375; C-309 to V-375; D-310 to V-375; S-311 to V- 
375; Q-312 to V-375; P-313 to V-375; G-314 to V-375; A-315 to V-375; S-316 to V- 
375; G-317 to V-375; S-318 to V-375; G-319 to V-375; V-320 to V-375; Y-321 to V- 
375; V-322 to V-375; R-323 to V-375; M-324 to V-375; W-325 to V-375; K-326 to V- 

20 375; R-327 to V-375; Q-328 to V-375; H-329 to V-375; Q-330 to V-375; K-331 to V- 
375; W-332 to V-375; E-333 to V-375; R-334 to V-375; K-335 to V-375; 1-336 to V- 
375; 1-337 to V-375; G-338 to V-375; M-339 to V-375; 1-340 to V-375; S-341 to V-375; 
G-342 to V-375; H-343 to V-375; Q-344 to V-375; W-345 to V-375; V-346 to V-375; 
D-347 to V-375; M-348 to V-375; D-349 to V-375; G-350 to V-375; S-351 to V-375; P- 

25 352 to V-375; Q-353 to V-375; E-354 to V-375; F-355 to V-375; T-356 to V-375; R-357 
to V-375; G-358 to V-375; C-359 to V-375; S-360 to V-375; E-361 to V-375; 1-362 to . 
V-375; T-363 to V-375; P-364 to V-375; L-365 to V-375; Q-366 to V-375; Y-367 to V- 
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375; 1-368 to V-375; P-369 to V-375; D-370 to V-375; of SEQ ID NO:38. Polypeptides 
encoded by these polynucleotides are also encompassed by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities such as 
ability to modulate the extracellular matrix, etc.) may still be retained. For example the 
ability to induce and/or bind to antibodies which recognize the complete or mature forms 
of the polypeptide generally will be retained when less than the majority of the residues 
of the complete or mature polypeptide are removed from the C-terminus. Whether a 
particular polypeptide lacking C-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted C-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 
polypeptide shown in Figures 25A-B, up to the glycine residue at position number 6, and 
polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues 1-ml of Figures 25A-B, 
where ml is an integer from 6 to 375 corresponding to the position of the amino acid 
residue in Figures 25A-B. Moreover, the invention provides polynucleotides encoding 
polypeptides comprising, or alternatively consisting of, the amino acid sequence of C- 
terminal deletions of the polypeptide of the invention shown as SEQ ID NO:38 include 
polypeptides comprising the amino acid sequence of residues: M-l to G-374; M-l to I- 
373; M-l to S-372; M-l to 1-371; M-l to D-370; M-l to P-369; M-l to 1-368; M-l to Y- 
367; M-l to Q-366; M-l to L-365; M-l to P-364; M-l to T-363; M-l to 1-362; M-l to E- 
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361; M-l to S-360; M-l to C-359; M-l to G-358; M-l to R-357; M-l to T-356; M-l to 
F-355; M-l to E-354; M-l to Q-353; M-l to P-352; M-l to S-351; M-l to G-350; M-l to 
D-349; M-l to M-348; M-l to D-347; M-l to V-346; M-l to W-345; M-l to Q-344; M-l 
to H-343; M-l to G-342; M-l to S-341; M-l to 1-340; M-l to M-339; M-l to G-338; M- 
5 1 to 1-337; M-l to 1-336; M-l to K-335; M-l to R-334; M-l to E-333; M-l to W-332; M- 
1 to K-331; M-l to Q-330; M-l to H-329; M-l to Q-328; M-l to R-327; M-l to K-326; 
M-l to W-325; M-l to M-324; M-l to R-323; M-l to V-322; M-l to Y-321; M-l to V- 
320; M-l toG-319; M-l to S-318; M-l to G-317; M-l to S-316; M-l to A-315; M-l to 
G-314: M-l to P-313; M-l to Q-312; M-l to S-311; M-l to D-310; M-l to C-309; M-l 

10 to Q-308; M-l to Q-307; M-l to Y-306; M-l to L-305; M-l to L-304; M-l to D-303; M- 
1 to Y-302; M-l to T-301; M-l to E-300; M-l to D-299; M-l to K-298; M-l to V-297; 
M-l to D-296; M-l to C-295; M-l to F-294; M-l to R-293; M-l to Y-292: M-l to V- 
291; M-l to L-290; M-l to N-289; M-l to G-288; M-l to P-287; M-l to R-286; M-l to 
D-285; M-l to N-284; M-l to D-283; M-l to Y-282; M-l to G-281; M-l to S-280; M-l 

15 to F-279; M-l to H-278; M-l to 1-277; M-l to R-276; M-l to G-275; M-l to G-274; M-l 
to P-273; M-l to L-272; M-l to Q-271; M-l to K-270; M-l to A-269; M-l to P-268; M-l 
to P-267; M-l to S-266; M-l to V-265; M-l to G-264; M-l to 1-263; M-l to K-262; M-l 
to M-261; M-l to F-260; M-l to K-259; M-l to R-258; M-l to K-257; M-l to H-256; M- 
1 to P-255; M-l to K-254; M-l to K-253; M-l to L-252; M-l to E-251; M-l to L-250; 

20 M-l to L-249; M-l to A-248; M-l to Y-247; M-l to D-246; M-l to Y-245; M- 1 to D- 
244; M-l to M-243; M-l to G-242; M-l to 1-241; M-l to D-240; M-l to N-239; M-l to 
A-238; M-l to N-237; M-l to G-236; M-l to K-235; M-l to 1-234; M-l to W-233; M-l 
to G-232; M-l to K-231; M-l to P-230; M-l to V-229; M l to H-228; M-l to T-227; M- 
1 to R-226; M-l to K-225; M-l to V-224; M-l to R-223; M-l to 1-222; M-l to W-221; 

25 M-l to Q-220; M-l to F-219; M-l to K-218; M-l to M-217; M-l to Q-216; M-l to E- 
215; M-l toP-214;M-l toM-213;M-l toA-212; M-l toS-211;M-I toT-210; M-l to 
S-209; M-l to D-208; M-l to N-207; M-l to A-206; M-l to G-205; M-l to R-204; M-l 
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to G-203; M-l to G-202; M-l to D-201; M-l to K-200; M-l to F-199; M-l to K-198; M- 
1 to P-197; M-l to K-196; M-l to L-195; M-l to F-194; M-l to G-193; M-l to V-192; 
M-l toR-191;M-l toL-190;M-l toK-189;M-l toQ-188;M-l toT-187;M-l toG- 
186; M-l to K-185; M-l to V-184; M-l to Y-183; M-l to T-182; M-l to K-181; M-l to 
G-180; M-l toD-179; M-l to H-178; M-l to 1-177; M-l to C-176; M-l to H-175; M-l to 
A-I74; M-l to A-173; M-l to T-172; M-l to L-171; M-l to V-170; M-l to H-169; M-l 
to K-168; M-l to E-167; M-l to A-166; M-l to V-165; M-l to L-164; M-l to T-163; M- 
1 toG-162;M-l toT-161;M-l toC-160;M-l toG-159; M-l toT-158;M-l toS-157; 
M-l to L-156; M-l to K-155; M-l to V-154; M-l to S-153; M-l to T-152; M-l to S-151 
M-l to F-150; M-l to P-149; M-l to Y-148; M-l to N-147; M-l to L-146; M-l to L-145 
M-l to F-144; M-l to D-143; M-l to K-142; M-l to G-141; M-l to F-140; M-l to I-I39 
M-l to S-138; M-l to F-137; M-l to R-136; M-l to S-135; M-l to D-134; M-l to Y-133 
M-l to G-132; M-l to Y-131; M-l to 1-130; M-l to Q-129; M-l to R-128; M-l to K-127 
M-l to R-126; M-l to R-125; M-l to S-124; M-l to K-123; M-l to G-122; M-l to S-121 
M-l to S-120; M-l to G-119; M-l to S-l 18; M-l to D-l 17; M-l to R-116; M-l to H- 
115; M-l toQ-114;M-I to A-113;M-1 toG-112;M-l toD-lll ; M-l toG-110;M-l to 
S-109; M-l to S-108; M-l to S-107; M-l to L-106; M-l to 1-105; M-l to Y-104; M-l to 
1-103; M-l to G-102; M-l to V-I01; M-l to Q-100; M-l to T-99; M-l to E-98; M-l toT- 
97; M-l to R-96; M-l to S-95; M-l to G-94; M-l to N-93; M-l to A-92; M-l to Y-91; 
M-l to L-90; M-l to T-89; M-l to E-88; M-l to Y-87; M-l to S-86; M-l to L-85; M-l to 
Y-84; M-l to Q-83; M-l to K-82; M-l to A-81; M-l to E-80; M-l to E-79; M-l to Y-78; 
M-l to T-77; M-l to P-76; M-l to L-75; M-l to P-74; M-l to T-73; M-l to G-72; M-l to 
K-71; M-l to H-70; M-l to C-69; M-l to Q-68; M-l to P-67; M-l to G-66; M-l to C-65; 
M-l to S-64; M-l to S-63; M-l to S-62; M-l to V-61; M-l to E-60; M-l to L-59; M-l to 
K-58; M-l to A-57; M-l to E-56; M-l to A-55; M-l to G-54; M-l to F-53; M-l to D-52; 
M-l to P-51; M-l to K-50; M-l to A-49; M-l to L-48; M-l to N-47; M-l to L-46; M-l 
to T-45; M-l to S-44; M-l to Q-43; M-l to P-42; M-l to L-41; M-l to V-40; M-l to V- 
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39; M-l to P-38; M-l to L-37; M-l to R-36; M-l to Y-35; M-l to A-34; M-l to P-33; M- 
1 to W-32; M-l to T-31; M-l to P-30; M-l to K-29; M-l to W-28; M-l to P-27; M-l" to 
A-26; M-l to S-25; M-l to Y-24; M-l to P-23; M-l to S-22; M-l to V-21; M-l to Q-20; 
M-l to G-19; M-l to V-18; M-l to A-17; M-l to C-16; M-l to L-15; M-l to L-14; M-l 
5 to F-13; M-l to F-12; M-l to L-l 1; M-l to L-10; M-l to F-9; M-l to L-8; M-l to L-7; 
M-l to G-6; of SEQ ID NO:38. Polypeptides encoded by these polynucleotides are also 
encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO:20 which have been determined 
10 from the following related cDNA genes: HFKCF40F (SEQ ID NO: 128), HSRDF26R 
(SEQ ID NO:129), HTEBE07R (SEQ ID NO:130), HFTBP82R (SEQ ID NO:131), 
HAQBJ11R (SEQ ID NO:132), HAFBB1 1R (SEQ ID NO:133), HOEF085R (SEQ ID 
NO:134), and HUVGY95R (SEQ ID NO:135). 

The gene encoding the disclosed cDNA is believed to reside on chromosome 12. 
1 5 Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
analysis for chromosome 12. 

This gene is expressed primarily in endothelial cells, fibroblasts, smooth muscle, 
and osteoblasts, and to a lesser extent in brain, heart, placental tissues, lung, and many 
other tissues. Moreover, the transcript is present in HUVEC, HUVEC +LPS, smooth 
20 muscle, fibroblasts; present in heart, brain, placenta, lung, liver, muscle, kidney, 

pancreas, spleen, thymus, prostate, testes, ovary, small intestine, colon and weakly in 
PBLs. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
25 biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, disorders of vascularized tissues. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
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differential identification of the tissue(s) or cell type(s). For a number of disoixlers of the 
above tissues or cells, particularly of the vascular tissues, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g. vascular, skeletal, developmental, neural, cardiovascular, pulmonary, renal, 
immune, hematopoietic, reproductive, gastrointestinal, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, seminal, fluid, amniotic fluid, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 38 as residues: Pro-67 to Thr-73, Pro-76 to Gln-83, Asn-93 to 
Thr-99, His-115 to Arg-128, His-178 to Lys-189, Pro-197 to Ala-212, VaI-224 to Trp- 
233, Lys-253 to Lys-259, Ser-280 to Asn-289, Asp-296 to Tyr-302, Gln-308 to Ala-315, 
Arg-327 to Lys-335, Asp-349 to Gly-358. Polynucleotides encoding said polypeptides 
are also provided. 

The tissue distribution in the vascularized endothelial cells indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the diagnosis 
and treatment of diseases of vascularized tissues, such as atherosclerosis, ataxia 
malabsortion, and hyperlipidemia. These and other factors often result in other 
cardiovascular disease. Furthermore, translation product of this gene is useful for the 
treatment of wounds, and may facilitate the wound healing process. Moreover, the 
protein is useful in the detection, treatment, and/or prevention of a variety of vascular 
disorders and conditions, which include, but are not limited to miscrovascular disease, 
vascular leak syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery 
disease, arteriosclerosis, and/or atherosclerosis. Based upon the tissue distribution of this 
protein, antagonists directed against this protein is useful in blocking the activity of this 
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protein. Accordingly, preferred are antibodies which specifically bind a portion of the 
translation product of this gene. 

Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 
product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 
for the translation product of this gene to a bodily fluid from the individual, preferably 
serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 
above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
are more particularly described elsewhere herein. Furthermore, the protein may also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:20 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 1685 of SEQ ID NO:20, b is an integer of 15 to 
1699, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:20, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 11 



The translation product of this gene shares sequence homology with Cytotoxic- 
Regulatory T-Cell Associated Molecule (CRTAM) protein, which is thought to be 
important in the regulation of celluar physiology, development, differentiation or 
function of various cell types, including haematopoietic cells and various T-cell 
progenitors. See for example, PCT publication WO 96/34102 incorporated herein by 
reference in its entirety. Moreover, the protein product of this gene also shares homology 
with the thymocyte activation and developmental protein and theclass-I MHC-restricted 
Tcell associated molecule (See Genbank Accession Nos. gi|2665790, gb|AAB88491.1, 
gb|AAC80267. 1 , and gi|3930163; all information and references contained within these 
accessions are hereby incorporated herein by reference). Based on the sequence 
similarity, the translation product of this gene is expected to share at least some 
biological activities with T-cell modulatory proteins. Such activities are known in the art, 
some of which are described elsewhere herein. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: 

MASVVLPSGSQCAAAAAAAAPPGLRLRLLLLLFSAAALIPTGDGQNLFTKDVTVI 

EGEVATISCQVr^SDDSVlQLLNPNRQTIYFRDFRPLKDSRFQLLNFSSSELKVSL 

TNVSISDEGRYFCQLYTDPPQESYTTITVLVPPRNLMIDIQKDTAVEGEEIEVNCT 

AMASKPATTIRWFKGNTELKGKSEVEEWSDMYTVTSQLMLKVHKEDDGVPVIC 

QVEHPAVTGNLQTQRYLEVQYKPQVHIQMTYPLQGLTREGDALELTCEAIGKPQ 

PVMVTWVRVDDEMPQHAVLSGPNLFINNLNKTDNGTYRCEASNIVGKAHSDY 

MLYVYDPPTTIPPP 1 1 1 1 1 1 1 llTnTr LTIITDSRAGEEGSIRAVDHAVIGGWAV 

VVFAMLCLLIILGRYFARHKGTYFTHEAKGADDAADADTAIINAEGGQNNSEEK 
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KEYH (SEQ ID NO: 136). Polynucleotides encoding these polypeptides are also 
provided. 

The polypeptide of this latter embodiment has been determined to have a 
transmembrane domain at about amino acid position 379 - 395 of the amino acid 
5 sequence referenced in Table XIII for this gene. Moreover, a cytoplasmic tail 

encompassing amino acids 396 to 442 of this protein has also been determined. Based 
upon these characteristics, it is believed that the protein product of this gene shares 
structural features to type la membrane proteins. 

Preferred polynucleotides comprise the following sequence: 
10 ATGGCGAGTGTAGTGC 

TGCCGAGCGGATCCCAGTGTGCGGCGGCAGCGGCGGCGGCGGCGCCTCCCG 
GGCTCCGGCTCCGGCTTCTGCTGTTGCTCTTCTCCGCCGCGGCACTGATCCCC 
ACAGGTGATGGGCAGAATCTGTTTACGAAAGACGTGACAGTGATCGAGGGA 
GAGGTTGCGACCATCAGTTGCCAAGTCAATAAGAGTGACGACTCTGTGATTC 

15 AGCTACTGAATCCCAACAGGCAGACCATTTATTTCAGGGACTTCAGGCCTTT 
GAAGGACAGCAGGTTTCAGTTGCTGAATTTTTCTAGCAGTGAACTCAAAGTA 
TCATTGACAAACGTCTCAATTTCTGATGAAGGAAGATACTTTTGCCAGCTCTA 
TACCGATCCCCCACAGGAAAGTTACACCACCATCACAGTCCTGGTCCCACCA 
CGTAATCTGATGATCGATATCCAGAAAGACACTGCGGTGGAAGGTGAGGAG 

20 ATTGAAGTCAACTGCACTGCTATGGCCAGCAAGCCAGCCACGACTATCAGGT 
GGTTCAAAGGGAACACAGAGCTAAAAGGCAAATCGGAGGTGGAAGAGTGGT 
CAGACATGTACACTGTGACCAGTCAGCTGATGCTGAAGGTGCACAAGGAGG 
ACGATGGGGTCCCAGTGATCTGCCAGGTGGAGCACCCTGCGGTCACTGGAAA 
CCTGCAGACCCAGCGGTATCTAGAAGTACAGTATAAGCCTCAAGTGCACATT 

25 CAGATGACTTATCCTCTACAAGGCTTAACCCGGGAAGGGGACGCGCTTGAGT 
TAACATGTGAAGCCATCGGGAAGCCCCAGCCTGTGATGGTAACTTGGGTGAG 
AGTCGATGATGAAATGCCTCAACACGCCGTACTGTCTGGGCCCAACCTGTTC 
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ATCAATAACCTAAACAAAACAGATAATGGTACATACCGCTGTGAAGCTTCAA 
ACATAGTGGGGAAAGCTCACTCGGATTATATGCTGTATGTATACGATCCCCC 
CACAACTATCCCTCCTCCCACAACAACCACCACCACCACCACCACCACCACC 
ACCACCATCCTTACCATCATCACAGATTCCCGAGCAGGTGAAGAAGGCTCGA 
5 TCAGGGCAGTGGATCATGCCGTGATCGGTGGCGTCGTGGCGGTGGTGGTGTT 
CGCCATGCTGTGCTTGCTCATCATTCTGGGGCGCTATTTTGCCAGACATAAAG 
GTACATACTTCACTCATGAAGCCAAAGGAGCCGATGACGCAGCAGACGCAG 
ACACAGCTATAATCAATGCAGAAGGAGGACAGAACAACTCCGAAGAAAAGA 
AAGAGTACTTCATCTAG (SEQ ID NO:137). Also preferred are the polypeptides 

10 encoded by these polynucleotides. 

Figures 28A-B shows the nucleotide (SEQ ID NO:21) and deduced amino acid 
sequence (SEQ ID NO:39) of the present invention. Predicted amino acids from about I 
to about 44 constitute the predicted signal peptide (amino acid residues from about 1 to 
about 44 in SEQ ID NO:39) and are represented by the underlined amino acid regions. 

1 5 Figure 29 shows the regions of similarity between the amino acid sequences of 

the present invention SEQ ID NO:39, the human poliovirus receptor protein (gi| 1524088) 
(SEQ ID NO: 138), the human class-I MHC-restricted T cell associated molecule 
(WO9634102) (SEQ ID NO: 144), and the Callus gallus thymocyte activation and 
developmental protein (gb|AAB88491.1) (SEQ ID NO: 145). 

20 Figure 30 shows an analysis of the amino acid sequence of SEQ ID NO:39. 

Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; amphipathic 
regions; flexible regions; antigenic index and surface probability are shown. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 

25 Figures 28A-B (SEQ ID NO:39), which was determined by sequencing a cloned cDNA. 
The nucleotide sequence shown in Figures 28A-B (SEQ ID NO:21) was obtained by 
sequencing a cloned cDNA (HOUDJ81), which was deposited on Nov. 17, 1998 at the 
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American Type Culture Collection, and given Accession Number 203484. The deposited 
gene is inserted in the pSpoit plasmid (Life Technologies, Rockville. MD) using the 
Sall/NotI restriction endonuclease cleavage sites. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO:21 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which arc useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO:2l. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO:21. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of polynucleotide fragments of the invention include, 
for example, fragments that comprise, or alternatively, consist of, a sequence from about 
nucleotide 1 to about 50, from about 51 to about 100, from about 101 to about 150, from 
about 151 to about 200, from about 201 to about 250, from about 251 to about 300, from 
about 301 to about 350, from about 351 to about 400, from about 401 to about 450, from 
about 451 to about 500, and from about 501 to about 550, and from about 551 to about 
600. from about 601 to about 650, from about 651 to about 700. from about 701 to about 
750. from about 751 to about 800, from about 801 to about 850. from about 851 to about 
900. from about 901 to about 950, from about 951 to about 1000. from about 1001 to 
about 1050, from about 1051 to about 1 100, from about 1101 to about 1150, from about 
1 151 to about 1200, from about 1201 to about 1250, from about 1251 to about 1300 
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from about 1301 to about 1350, from about 1351 to about 1400, from about 1401 to 
about 1450, from about 1451 to about 1500, from about 1501 to about 1520 of SEQ ID 
NO:21, or the complementary strand thereto, or the cDNA contained in the deposited 
gene. In this context "about" includes the particularly recited ranges, larger or smaller by 
5 several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. In additional 
embodiments, the polynucleotides of the invention encode functional attributes of the 
corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
10 beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions. The data representing 
the structural or functional attributes of the protein set forth in Figure 30 and/or Table X, 
15 as described above, was generated using the various modules and algorithms of the 

DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XIV of Table X can be used to determine regions of the 
prote.n which exhibit a high degree of potential for antigenicity. Regions of high 
antigenicity are determined from the data presented in columns VIII, IX, XIII, and/or 
20 XIV by choosing values which represent regions of the polypeptide which are likely to 
be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 30, but may, as 
shown in Table X, be represented or identified by using tabular representations of the 
25 data presented in Figure 30. The DNA*STAR computer algorithm used to generate 
Figure 30 (set on the original default parameters) was used to present the data in Figure 
30 in a tabular format (See Table X). The tabular format of the data in Figure 30 is used 
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to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 30 and in Table X include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figures 28A-B. As set out in Figure 30 and in Table X, such preferred regions 
include Garnier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, 
Chou-Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic 
regions and Hopp-Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic 
regions, Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index 
and Emini surface-forming regions. Even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N-terminus. Whether a particular 
polypeptide lacking N-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence shown in 
Figures 28A-B, up to the threonine residue at position number 359 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues n 1-364 of Figures 28A-B, where nl is an 
integer from 2 to 359 corresponding to the position of the amino acid residue in Figures 
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28A-B (which is identical to the sequence shown as SEQ ID NO:39). N-teiminal 
deletions of the polypeptide of the invention shown as SEQ ID NO:39 include 
polypeptides comprising the amino acid sequence of residues: A-2 to R-364; S-3 to R- 
364; V-4 to R-364; V-5 to R-364; L-6 to R-364; P-7 to R-364; S-8 to R-364; G-9 to R- 
364; S-10 to R-364; Q-l 1 to R-364; C-12 to R-364; A-13 to R-364; A-14 to R-364; A-15 
to R-364; A-16 to R-364; A-17 to R-364; A-18 to R-364; A-19 to R-364; A-20 to R-364; 
P-21 to R-364; P-22 to R-364; G-23 to R-364; L-24 to R-364; R-25 to R-364; L-26 to R- 
364; R-27 to R-364; L-28 to R-364; L-29 to R-364; L-30 to R-364; L-31 to R-364; L-32 
to R-364; F-33 to R-364; S-34 to R-364; A-35 to R-364; A-36 to R-364; A-37 to R-364; 
L-38 to R-364; 1-39 to R-364; P-40 to R-364; T-41 to R-364; G-42 to R-364; D-43 to R- 
364; G^4 to R-364; Q-45 to R-364; N-46 to R-364; L-47 to R-364; F-48 to R-364; T-49 
to R-364; K-50 to R-364; D-5 1 to R-364; V-52 to R-364; T-53 to R-364; V-54 to R-364; 
1-55 to R-364; E-56 to R-364; G-57 to R-364; E-58 to R-364; V-59 to R-364; A-60 to R- 
364; T-61 to R-364; 1-62 to R-364; S-63 to R-364; C-64 to R-364; Q-65 to R-364; V-66 
to R-364; N-67 to R-364; K-68 to R-364; S-69 to R-364; D-70 to R-364; D-71 to R-364; 
S-72 to R-364; V-73 to R-364; 1-74 to R-364; Q-75 to R-364; L-76 to R-364; L-77 to R- 
364; N-78 to R-364; P-79 to R-364; N-80 to R-364; R-81 to R-364; Q-82 to R-364; T-83 
to R-364; 1-84 to R-364; Y-85 to R-364; F-86 to R-364; R-87 to R-364; D-88 to R-364; 
F-89 to R-364; R-90 to R-364; P-91 to R-364; L-92 to R-364; K-93 to R-364; D-94 to R- 
364; S-95 to R-364; R-96 to R-364; F-97 to R-364; Q-98 to R-364; L-99 to R-364; L-100 
to R-364; N-101 to R-364; F-102 to R-364; S-103 to R-364; S-104 to R-364; S-105 to R- 
364; E-106 to R-364; L-107 to R-364; K-108 to R-364; V-109 to R-364; S-l 10 to R-364; 
L-l 1 1 to R-364; T-l 12 to R-364; N-l 13 to R-364; V-l 14 to R-364; S-l 15 to R-364; I- 
1 16 to R-364; S-l 17 to R-364; D-118 to R-364; E-l 19 to R-364; G-120 to R-364; R-121 
to R-364; Y-122 to R-364; F-123 to R-364; C-124 to R-364; Q-125 to R-364; L-126 to 
R-364; Y-127 to R-364; T-128 to R-364; D-129 to R-364; P-130 to R-364; P-131 to R- 
364; Q-132 to R-364; E-133 to R-364; S-134 to R-364; Y-135 to R-364; T-136 to R-364; 
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T-137 to R-364; 1-138 to R-364; T-139 to R-364; V-140 to R-364; L-141 to R-364; V- 
142 to R-364; P-143 to R-364; P-144 to R-364; R-145 to R-364; N-146 to R-364; L-147 
to R-364; M-148 to R-364; 1-149 to R-364; D-150 to R-364; 1-151 to R-364; Q-152 to R- 
364; K-153 to R-364; D-154 to R-364; T-155 to R-364; A-156 to R-364; V-157 to R- 
5 364; E-158 to R-364; G-159 to R-364; E-160 to R-364; E-161 to R-364; 1-162 to R-364; 
E-163 to R-364; V-164 to R-364; N-165 to R-364; C-166 to R-364; T-167 to R-364; A- 
168 to R-364; M-169 to R-364; A-170 to R-364; S-171 to R-364; K-172 to R-364; P-173 
to R-364; A-174 to R-364; T-175 to R-364; T-176 to R-364; 1-177 to R-364; R-178 to R- 
364; W-179 to R-364; F-180 to R-364; K-181 to R-364; G-182 to R-364; N-183 to R- 

10 364; T-184 to R-364; E-185 to R-364; L-186 to R-364; K-I87 to R-364; G-188 to R-364; 
K-189 to R-364; S-190 to R-364; E-191 to R-364; V-192 to R-364; E-193 to R-364; E- 
194 to R-364; W-195 to R-364; S-196 to R-364; D-197 to R-364; M-I98 to R-364; Y- 
199 to R-364; T-200 to R-364; V-201 to R-364; T-202 to R-364; S-203 to R-364; Q-204 
to R-364; L-205 to R-364; M-206 to R-364; L-207 to R-364; K-208 to R-364; V-209 to 

15 R-364; H-210 to R-364; K-21 1 to R-364; E-212 to R-364; D-213 to R-364; D-214 to R- 
364; G-215 to R-364; V-216 to R-364; P-217 to R-364; V-218 to R-364; 1-219 to R-364; 
C-220 to R-364; Q-221 to R-364; V-222 to R-364; E-223 to R-364; H-224 to R-364; P- 
225 to R-364; A-226 to R-364; V-227 to R-364; T-228 to R-364; G-229 to R-364; N-230 
to R-364; L-231 to R-364; Q-232 to R-364; T-233 to R-364; Q-234 to R-364; R-235 to 

20 R-364; Y-236 to R-364; L-237 to R-364; E-238 to R-364; V-239 to R-364; Q-240 to R- 
364; Y-241 to R-364; K-242 to R-364; P-243 to R-364; Q-244 to R-364; V-245 to R- 
364; H-246 to R-364; 1-247 to R-364; Q-248 to R-364; M-249 to R-364; T-250 to R-364; 
Y-251 to R-364; P-252 to R-364; L-253 to R-364; Q-254 to R-364; G-255 to R-364; L- 
256 to R-364; T-257 to R-364; R-258 to R-364; E-259 to R-364; G-260 to R-364; D-261 

25 to R-364; A-262 to R-364; L-263 to R-364; E-264 to R-364; L-265 to R-364; T-266 to 
R-364; C-267 to R-364; E-268 to R-364; A-269 to R-364; 1-270 to R-364; G-271 to R- 
364; K-272 to R-364; P-273 to R-364; Q-274 to R-364; P-275 to R-364; V-276 to R-364; 
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M-277 to R-364; V-278 to R-364; T-279 to R-364; W-280 to R-364; V-28 1 to R-364; R- 
282 to R-364; V-283 to R-364; D-284 to R-364; D-285 to R-364; E-286 to R-364; M- 
287 to R-364; P-288 to R-364; Q-289 to R-364; H-290 to R-364; A-291 to R-364; V-292 
to R-364; L-293 to R-364; S-294 to R-364; G-295 to R-364; P-296 to R-364; N-297 to 
R-364; L-298 to R-364; F-299 to R-364; 1-300 to R-364; N-301 to R-364; N-302 to R- 
364; L-303 to R-364; N-304 to R-364; K-305 to R-364; T-306 to R-364; D-307 to R- 
364; N-308 to R-364; G-309 to R-364; T-310 to R-364; Y-311 to R-364; R-312 toR- 
364; C-313 to R-364; E-314 to R-364; A-315 to R-364; S-316 to R-364; N-317 to R-364; 
1-318 to R-364; V-319 to R-364; G-320 to R-364; K-321 to R-364; A-322 to R-364; H- 
323 to R-364; S-324 to R-364; D-325 to R-364; Y-326 to R-364; M-327 to R-364; L-328 
to R-364; Y-329 to R-364; V-330 to R-364; Y-331 to R-364; D-332 to R-364; P-333 to 
R-364; P-334 to R-364; T-335 to R-364; T-336 to R-364; 1-337 to R-364; P-338 to R- 
364; P-339 to R-364; P-340 to R-364; T-341 to R-364; T-342 to R-364; T-343 to R-364; 
T-344 to R-364; T-345 to R-364; T-346 to R-364; T-347 to R-364; T-348 to R-364: T- 
349 to R-364; T-350 to R-364; T-351 to R-364; T-352 to R-364; T-353 to R-364; 1-354 
to R-364; L-355 to R-364; T-356 to R-364; 1-357 to R-364; 1-358 to R-364; T-359 to R- 
364; of SEQ ID NO:39. Polypeptides encoded by these polynucleotides are also 
encompassed by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities such as 
ability to modulate the extracellular matrix, etc.) may still be retained. For example the 
ability to induce and/or bind to antibodies which recognize the complete or mature forms 
of the polypeptide generally will be retained when less than the majority of the residues 
of the complete or mature polypeptide are removed from the C-terminus. Whether a 
particular polypeptide lacking C-termina! residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
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and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted C-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 
polypeptide shown in Figures 28A-B, up to the leucine residue at position number 6, and 
polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues I-ml of Figures 28A-B, 
where ml is an integer from 6 to 364 corresponding to the position of the amino acid 
residue in Figures 28A-B. Moreover, the invention provides polynucleotides encoding 
polypeptides comprising, or alternatively consisting of, the amino acid sequence of C- 
terminal deletions of the polypeptide of the invention shown as SEQ ID NO:39 include 
polypeptides comprising the amino acid sequence of residues: M-l to A-363; M-l to R- 
362; M-l to S-361; M-l to D-360; M-l to T-359; M-l to 1-358; M-l to 1-357; M-l to T- 
356; M-l to L-355; M-l to 1-354; M-l to T-353; M-l to T-352; M-l to T-351; M-l to T- 
350; M-l to T-349; M-l to T-348; M-l to T-347; M-l to T-346; M-l to T-345; M-l to T- 
344; M-l toT-343; M-l to T-342; M-l to T-341; M-l to P-340; M-l to P-339; M-l to P- 
338; M-l to 1-337; M-l to T-336; M-l to T-335; M-l to P-334; M-l to P-333; M-l to D- 
332; M-l to Y-331; M-l to V-330; M-l to Y-329; M-l to L-328; M-l to M-327; M-l to 
Y-326; M-l to D-325; M-l to S-324; M-l to H-323; M-l to A-322; M-l to K-321; M-l 
to G-320; M-l to V-319; M-l to 1-318; M-l to N-317; M-l to S-316; M-l to A-315; M-l 
to E-314; M-l to C-313; M-l to R-312; M-l to Y-31 1; M-l to T-310; M-l to G-309; M- 
1 to N-308; M-l to D-307; M-l to T-306; M-l to K-305; M-l to N-304; M-l to L-303; 
M-l to N-302; M-l to N-301; M-l to 1-300; M-l to F-299; M-l to L-298; M-l to N-297; 
M-l to P-296; M-l to G-295; M-l to S-294; M-l to L-293; M-l to V-292; M-l to A-291; 
M-l to H-290; M-l to Q-289; M-l to P-288; M-l to M-287; M-l to E-286; M-l to D- 
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285; M-l to D-284; M-l to V-283; M-l to R-282; M-l to V-281; M-l to W-280; M-l to 
T-279; M-l to V-278; M-l to M-277; M-l to V-276; M-l to P-275; M-l to Q-274; M-l 
to P-273; M-l to K-272; M-l to G-271; M-l to 1-270; M-l to A-269; M-l to E-268; M-l 
to C-267; M-l to T-266; M-l to L-265; M-l to E-264; M-l to L-263; M-l to A-262; M-l 
to D-261; M-l to G-260; M-l to E-259; M-l to R-258; M-l to T-257; M-l to L-256; M- 
1 to G-255; M-l to Q-254; M-l to L-253; M-l to P-252; M-l to Y-251 ; M-l to T-250; 
M-i to M-249; M-l to Q-248; M-l to 1-247; M-l to H-246; M-l to V-245; M-l to Q- 
244; M-l to P-243; M-l to K-242; M-l to Y-241; M-l to Q-240; M-l ,o V-239; M-l to 
E-238; M-l to L-237; M-l to Y-236; M-l to R-235; M-l to Q-234; M-l toT-233; M-l 
to Q-232; M-l to L-231: M-l to N-230; M-l to G-229; M-l to T-228; M-l to V-227; M- 
1 to A-226; M-l to P-225; M-I to H-224; M-l to E-223; M-l to V-222; M-l to Q-221; 
M-l to C-220; M-l to 1-219; M-l to V-218; M-l to P-217: M-l to V-216; M-l to G-215; 
M-l toD-214;M-l toD-213;M-l toE-212;M-l toK-211;M-l toH-210;M-l to V- 
209; M-l to K-208; M-l to L-207; M-l to M-206; M-l to L-205; M-l to Q-204; M-l to 
S-203; M-l to T-202; M-l to V-201; M-l to T-200; M-l to Y-199; M-l to M-198; M-l 
to D-197; M-l to S-196; M-l to W-195; M-l to E-194; M-l to E-193; M-l to V-192; M- 
1 to E-191; M-l to S-190; M-l to K-189; M-l to G-188; M-l to K-187; M-l to L-186; 
M-l to E-185; M-l to T-184; M-l to N-183; M-l to G-182; M-l to K-181; M-l to F- 
180; M-l to W-179; M-l to R-178; M-l to 1-177; M-l to T-176; M-l to T-175; M-l to 
A-174; M-l to P-173: M-l to K-172; M-l to S-171; M-l to A-170; M-l to M-169; M-l 
to A-168; M-l to T-167; M-l to C-166; M-l to N-165; M-l to V-164; M-l to E-163; M- 
1 to 1-162; M-l to E-161; M-l to E-160; M-l to G-159; M-l to E-158; M-l to V-157; M- 
1 to A-156; M-l toT-155; M-l to D-154; M-l to K-I53; M-l to Q-152; M-l to 1-151; 
M-l to D-150; M-l to 1-149; M-l to M-148; M-l to L-147; M-l to N-146; M-l to R- 
145; M-l toP-144;M-l toP-143;M-l to V-142;M-1 toL-141;M-l to V-140;M-1 to 
T-139; M-l to 1-138; M-l to T-137; M-l to T-136; M-l to Y-135; M-l to S-134; M-l to 
E-133;M-1 toQ-132;M-l toP-131;M-l toP-130;M-l toD-129;M-l toT-128;M-l to 
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Y-127; M-l to L-126; M-l to Q-125; M-l to C-124; M-l to F-123; M-l to Y-122; M-l 
to R-121; M-l to G-120; M-l to E-119; M-l to D-l 18; M-l to S-117; M-l to 1-116; M-l 
toS-115; M-l to V-114; M-l toN-113; M-l toT-112; M-l toL-111; M-l to S-l 10; M-l 
to V-109; M-l to K-108; M-l to L-107; M-l to E-106; M-l to S-105; M-l to S-104; M-l 
5 to S-103; M-l to F-102; M-l to N-101; M-l to L-100; M-l to L-99; M-l to Q-98; M-l to 
F-97; M-l to R-96; M-l to S-95; M-l to D-94; M-l to K-93; M-l to L-92; M-l to P-91; 
M-l to R-90; M-l to F-89; M-l to D-88; M-l to R-87; M-l to F-86; M-l to Y-85; M-l to 
1-84; M-l to T-83; M-l to Q-82; M-l to R-81; M-l to N-80; M-l to P-79; M-l to N-78; 
M-l to L-77; M-l to L-76; M-l to Q-75; M-l to 1-74; M-l to V-73; M-l to S-72; M-l to 

10 D-71; M-l to D-70; M-l to S-69: M-l to K-68; M-l to N-67; M-l to V-66; M-l to Q-65; 
M-l to C-64; M-l to S-63; M-l to 1-62; M-l to T-61; M-l to A-60; M-l to V-59; M-l to 
E-58; M-l to G-57; M-l to E-56; M-l to 1-55; M-l to V-54; M-l to T-53; M-l to V-52; 
M-l to D-51; M-l to K-50; M-l to T-49; M-l to F-48; M-l to L-47; M-l to N-46; M-l 
to Q-45; M-l to G-44; M-l to D-43; M-l to G-42; M-l to T-41; M-l to P-40; M-l to I- 

15 39; M-l to L-38; M-l to A-37; M-l to A-36; M-l to A-35; M-l to S-34; M-l to F-33; M- 
1 to L-32; M-l to L-31; M-l to L-30; M-l to L-29; M-l to L-28; M-l to R-27; M-l to L- 
26; M-l to R-25; M-l to L-24; M-l to G-23; M-l to P-22; M-l to P-21; M-l to A-20; M- 
1 to A-19; M-l to A-18; M-l to A-17; M-l to A-16; M-l to A-15; M-l to A-14; M-l to 
A-13; M-l to C-12; M-l to Q-l 1; M-l to S-10; M-l to G-9; M-l to S-8; M-l to P-7; M-l 

20 to L-6; of SEQ ID NO:39. Polypeptides encoded by these polynucleotides are also 
encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO:21 which have been determined 
from the following related cDNA genes: HSQFJ92R (SEQ ID NO: 139), HFLAB18F 

25 (SEQ ID NO:140), HAQBH82R (SEQ ID NO:141), HLHTM10R (SEQ ID NO:142), 
and HLHAL65R (SEQ ID NO: 143). 
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This gene is expressed primarily in immune system related tissues such as 
ulcerative colitis, rejected kidney tissues, and to a lesser extent in thymus and bone 
marrow. Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, immune and hematopoietic diseases and/or disorders, particularly ulcerative 
colitis and rejected organs. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g. transplanted kidney, 
immune, hematopoeitic, renal, and cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 39 as residues: Gly-42 to Phe-48, Val-66 to Asp-71 , Asn-78 to 
Thr-83, Asp-88 to Arg-96, Tyr-127 toTyr-135, Lys-181 toTrp-195, His-210 to Gly-215, 
Leu-303 to Thr-310, Thr-341 to Thr-350. Polynucleotides encoding said polypeptides are 
also provided. 

The tissue distribution primarily in immune cells and tissues, combined with the 
homology to the CRTAM, thymocyte activation and developmental protein, the class-I 
MHC-restricted T cell associated molecule protein, and the polivirus receptor, indicates 
that the protein products of this gene are useful for the regulation of celluar physiology, 
development, differentiation or function of various cell types, including haematopoietic 
cells and particularly T-cell progenitors. Representative uses are described in the 
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"Immune Activity" and "infectious disease" sections below, in Example 11, 13, 14, 16, 
18, 19, 20, and 27, and elsewhere herein. The proteins can be used to develop products 
for the diagnosis and treatment of conditions associated with abnormal physiology or 
development, including abnormal proliferation, e.g. cancers, or degenerative conditions. 

5 The physiology or development of a cell can be modulated by contacting the cell with an 
agonist or antagonist (i.e. an anti- CRTAM-like peptide antibody). Further the CRTAM- 
like polypeptides of the present invention include treatment of ulcerative colitis, organ 
rejection and other immune system related disorders. Agonists or antagonists may treat 
or prevent such disorders as ulcerative colitis and rejected organs, such as kidney. Based 

10 upon the tissue distribution of this protein, antagonists directed against this protein is 
useful in blocking the activity of this protein. Accordingly, preferred are antibodies 
which specifically bind a portion of the translation product of this gene. 

Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 

15 product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 
for the translation product of this gene to a bodily fluid from the individual, preferably 
serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 

20 above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
are more particularly described elsewhere herein. Furthermore, the protein may also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement Protein, as well as, antibodies directed 

25 against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:21 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
b, where a is any integer between 1 to 1506 of SEQ ID NO:21, b is an integer of 15 to 
1520, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:21, and where b is greater than or equal to a + 14. 



a- 



FEATURES OF PROTEIN ENCODED BY GENE NO: 12 

Figure 31 shows the nucleotide (SEQ ID NO:22) and deduced amino acid 
sequence (SEQ ID NO:40) of the present invention. Predicted amino acids from about 1 
to about 23 constitute the predicted signal peptide (amino acid residues from about 1 to 
about 23 in SEQ ID NO:40) and are represented by the underlined amino acid regions. 

Figure 32 shows the regions of similarity between the amino acid sequences of 
the present invention SEQ ID NO:40 and the human FAP protein (gi|1890647) (SEQ ID 
NO: 146). 

Figure 33 shows an analysis of the amino acid sequence of SEQ ID NO:40. 

Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; 
amphipathic regions; flexible regions; antigenic index and surface probability are shown. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 
Figure 31 (SEQ ID NO:40), which was determined by sequencing a cloned cDNA. The 
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nucleotide sequence shown in Figure 31 (SEQ ID NO:22) was obtained by sequencing a 
cloned cDNA (HPWCM76), which was deposited on Nov. 17, 1998 at the American 
Type Culture Collection, and given Accession Number 203484. The deposited gene is 
inserted in the pSport plasmid (Life Technologies, Rockville, MD) using the Sall/NotI 
5 restriction endonuclease cleavage sites. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO:22 is intended DNA fragments at least about 15nt, and more preferably at least about 

10 20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO:22. By a fragment at least 20 nt in 

15 length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO:22. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of polynucleotide fragments of the invention include, 

20 for example, fragments that comprise, or alternatively, consist of, a sequence from about 
nucleotide 1 to about 50, from about 51 to about 100, from about 101 to about 150, from 
about 151 to about 200, from about 201 to about 250, from about 251 to about 300, from 
about 301 to about 350, from about 351 to about 400, from about 401 to about 450, from 
about 451 to about 500, and from about 501 to about 550, and from about 551 to about 

25 600, from about 601 to about 650, from about 65 1 to about 700, from about 701 to about 
750, from about 751 to about 800, from about 801 to about 807 of SEQ ID NO:22, or the 
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complementary strand thereto, or the cDNA contained in the deposited gene. In this 
context "about" includes the particularly recited ranges, larger or smaller 

by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. In 
additional embodiments, the polynucleotides of the invention encode functional 
attributes of the corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions. The data representing 
the structural or functional attributes of the protein set forth in Figure 33 and/or Table XI, 
as described above, was generated using the various modules and algorithms of the 
DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XI V of Table XI can be used to determine regions of the 
protein which exhibit a high degree of potential for antigenicity. Regions of high 
antigenicity are determined from the data presented in columns VIII, IX, XIII, and/or 
XIV by choosing values which represent regions of the polypeptide which are likely to 
be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 33, but may, as 
shown in Table XI, be represented or identified by using tabular representations of the 
data presented in Figure 33. The DNA*STAR computer algorithm used to generate 
Figure 33 (set on the original default parameters) was used to present the data in Figure 
33 in a tabular format (See Table XI). The tabular format of the data in Figure 33 is used 
to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 33 and in Table XI include, but are not limited to, 
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regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figure 31. As set out in Figure 33 and in Table XI, such preferred regions include 
Garnier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, Chou- 
Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic regions 
5 and Hopp- Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic regions, 
Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index and 
Emini surface-forming regions. Even if deletion of one or more amino acids from the N- 
terminus of a protein results in modification of loss of one or more biological functions 
of the protein, other functional activities (e.g., biological activities, ability to 

10 multimerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N-terminus. Whether a particular 
polypeptide lacking N-terminal residues of a complete polypeptide retains such 

15 immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

20 Accordingly, the present invention further provides polypeptides having one or 

more residues deleted from the amino terminus of the amino acid sequence shown in 
Figure 31, up to the arginine residue at position number 61 and polynucleotides encoding 
such polypeptides. In particular, the present invention provides polypeptides comprising 
the amino acid sequence of residues nl-66 of Figure 31, where nl is an integer from 2 to 

25 61 corresponding to the position of the amino acid residue in Figure 31 (which is 
identical to the sequence shown as SEQ ID NO:40). N-terminal deletions of the 
polypeptide of the invention shown as SEQ ID NO:40 include polypeptides comprising 
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the amino acid sequence of residues: S-2 to N-66; S-3 to N-66; S-4 to N-66; S-5 to N-66; 
L-6 to N-66; K-7 to N-66; H-8 to N-66; L-9 to N-66; L-10 to N-66; C-l 1 to N-66; M-12 
to N-66; A-13 to N-66; L-14 to N-66; S-15 to N-66; W-16 to N-66; F-17 to N-66; S-18 
to N-66; S-19 to N-66; F-20 to N-66; 1-21 to N-66; S-22 to N-66; G-23 to N-66; E-24 to 
N-66; T-25 to N-66; S-26 to N-66; F-27 to N-66; S-28 to N-66; L-29 to N-66; L-30 to N- 
66; N-31 to N-66; S-32 to N-66; F-33 to N-66; F-34 to N-66; L-35 to N-66; P-36 to N- 
66; Y-37 to N-66; P-38 to N-66; S-39 to N-66; S-40 to N-66; R-41 to N-66; C-42 to N- 
66; C-43 to N-66; C-44 to N-66; F-45 to N-66; S^6 to N-66; V-47 to N-66; Q-48 to N- 
66; C-49 to N-66; S-50 to N-66; 1-51 to N-66; L-52 to N-66; D-53 to N-66; P-54 to N- 
66; F-55 to N-66; S-56 to N-66: C-57 to N-66; N-58 to N-66; S-59 to N-66; M-60 to N- 
66; R-61 to N-66; of SEQ ID NO:40. Polypeptides encoded by these polynucleotides are 
also encompassed by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities such as 
ability to modulate the extracellular matrix, etc.) may still be retained. For example the 
ability to induce and/or bind to antibodies which recognize the complete or mature forms 
of the polypeptide generally will be retained when less than the majority of the residues 
of the complete or mature polypeptide are removed from the C-terminus. Whether a 
particular polypeptide lacking C-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted C-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 
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polypeptide shown in Figure 31, up to the leucine residue at position number 6, and 
polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues 1-ml of Figure 31, where 
ml is an integer from 6 to 66 corresponding to the position of the amino acid residue in 
5 Figure 3 1 . Moreover, the invention provides polynucleotides encoding polypeptides 
comprising, or alternatively consisting of, the amino acid sequence of C-terminal 
deletions of the polypeptide of the invention shown as SEQ ID NO:40 include 
polypeptides comprising the amino acid sequence of residues: M-l to E-65; M-l to W- 
64; M-l to P-63; M-l to F-62; M-l to R-61; M-l to M-60; M-l to S-59; M-l to N-58; 

10 M-l to C-57; M-l to S-56; M-l to F-55; M-l to P-54; M-l to D-53; M-l to L-52; M-l to 
1-51; M-l to S-50; M-l to C-49; M-l to Q-48; M-l to V-47; M-l to S-46; M-l to F-45; 
M-l to C-44; M-l to C-43; M-l to C-42; M-l to R-41; M-l to S-40; M-l to S-39; M-l to 
P-38; M-l to Y-37; M-l to P-36; M-l to L-35; M-l to F-34; M-l to F-33; M-l to S-32; 
M-l to N-31; M-I to L-30; M-l to L-29; M-l to S-28; M-l to F-27; M-l to S-26; M-l to 

15 T-25; M-l to E-24; M-l to G-23; M-l to S-22; M-l to 1-21; M-l to F-20; M-l to S-19; 
M-l to S-18; M-l to F-17; M-l to W-16; M-l to S-15; M-l to L-14; M-l to A-13; M-l 
to M-12; M-l to C-l 1; M-l to L-10; M-l to L-9; M-l to H-8; M-l to K-7; M-l to L-6; of 
SEQ ID NO:40. Polypeptides encoded by these polynucleotides are also encompassed by 
the invention. 

20 In addition, the invention provides nucleic acid molecules having nucleotide 

sequences related to extensive portions of SEQ ID NO:22 which have been determined 
from the following related cDNA genes: HPWCM76R (SEQ ID NO: 147). 

This gene is expressed primarily in prostate BPH (benign prostatic hyperplasia) 
tissue. Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, inflammation of the prostate, or related tissues. Similarly, polypeptides and 
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antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the prostate, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g. prostate, cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in prostate BPH tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment of inflammatory 
conditions which result in an enlargement of the prostate, or related tissues. 
Polynucleotides and polypeptides corresponding to this gene are useful for the treatment 
and diagnosis of conditions concerning proper testicular function (e.g. endocrine 
function, sperm maturation), as well as cancer. Therefore, this gene product is useful in 
the treatment of male infertility and/or impotence. This gene product is also useful in 
assays designed to identify binding agents, as such agents (antagonists) are useful as 
male contraceptive agents. Similarly, the protein is believed to be useful in the treatment 
and/or diagnosis of testicular cancer. The testes are also a site of active gene expression 
of transcripts that is expressed, particularly at low levels, in other tissues of the body. 
Therefore, this gene product is expressed in other specific tissues or organs where it may 
play related functional roles in other processes, such as hematopoiesis, inflammation, 
bone formation, and kidney function, to name a few possible target indications. Based 
upon the tissue distribution of this protein, antagonists directed against this protein is 
useful in blocking the activity of this protein. Accordingly, preferred are antibodies 
which specifically bind a portion of the translation product of this gene. 
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Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 
product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 
5 for the translation product of this gene to a bodily fluid from the individual, preferably 
serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 
above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
are more particularly described elsewhere herein. The homology to the FAP protein 

10 indicates that the protein product of this gene is useful in treating, detecting, and/or 
preventing iron metabolism disorders, particularly those resulting in high oxidative 
states, tissue damage, athersclerosis, free radical damage, vascular disorders, iron binding 
protein dysfunction, nitric oxide synthase dysfunction or aberration, vasodilation 
disorders, and tissue edema. Based on the sequence similarity, the translation product of 

15 this gene is expected to share at least some biological activities with iron metabolism 
modulatory proteins. Such activities are known in the art, some of which are described 
elsewhere herein. Furthermore, the protein may also be used to determine biological 
activity, to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 

20 supplement. Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:22 and may have been publicly available prior to conception of the present 

25 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
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polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 793 of SEQ ID NO:22, b is an integer of 15 to 
807, where both a and b correspond to the positions of nucleotide residues shown in SEQ 
ID NO:22, and where b is greater than or equal to a + 14. 
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F 


3.40 6.80 


T 




1.24 


-1.44 


* * 


F 


3.06 2.43 


T 




1.40 


-1.01 


* * 


F 


2.72 2.61 






1.40 


-0.71 


* 


F 


1.78 1.13 






1.50 


-0,71 


* * 


F 


1.44 1.65 






0.64 


-0.67 


* * 




0.95 128 






0.36 


0.01 


* * 




-0.10 0.54 






0.36 


0.51 


* * 


- 


-0.60 . 0.77 






-0.07 


-0.17 


* * 


F 


0.60 1.00 






-0.03 


-0.11 


* * 


F 


0.60 1.67 






-0.63 


-0.26 


* * 


F 


0.45 0.63 






0.07 


0.43 


* * 


F 


-0.45 0.34 






-0.86 


0.00 






030 0.68 






-1.46 


0.00 






0.30 0.56 






-0.60 


0.19 






-0.30 0.23 






-0.01 


0.19 


* 




-0.30 032 






-0.06 


-0.30 


• 




0.30 0.74 






-0.04 


-0.54 




F 


0.90 1.56 






0.81 


-0.84 




F 


0.90 1.25 






1.67 


-0.84 




F 


0.90 2.63 






1.08 


-1.53 




F 


. 0.90 638 






1.30 


-1.71 




F 


0.90 3.22 






2.22 


-1.21 




F 


0.90 1.88 






2.22 


-1.21 




F 


0.90 2.13 






1.92 


-1.61 


* 


F 


0.90 3.17 
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Ala 


449 


A 


A 


• 




• 




1.62 


-1.61 






F 


0.90 2.46 


Arg 


450 


A . 


A 


• 




• 




1.62 


-1.23 


* 




F 


0.90 1.65 


Lys 


451 


A 


A 


• 




• 




1.03 


-0.84 


* 




F 


0.90 2.18 


Ser 


452 


A 




• 




T 




0.68 


-0.84 


* 




F 


1.30 2.18 


Ser 


453 


A 








T 




0.46 


-0.86 






F 


130 1.61 


Ser 


454 






B 




T 




0.46 


-0.17 






F 


0.85 0.66 


Ala 


455 






B 




T 




0.04 


0.33 






F 


0.25 0.50 


Thr 


456 






B 








-0.34 


0.33 








-0.10 0.50 


Leu 


457 






B 








-0.26 


0.37 








-0.10 037 


Ala 


458 






B 




T 




-0.54 


0.41 






F 


-0.05 0.57 


Ser 


459 






B 




T 




-0.24 


0.41 






F 


-0.05 0.40 


Gly 


460 






• 




T 


C 


0.00 


033 






F 


0.45 0.83 


Pro 


461 










T 


C 


-0.50 


0.07 


* 




F 


0.45 0.81 


Ala 


462 






* 




* 


c 


0.01 


0.26 


* 




F 


0.25 0.50 


Gin 


463 


A 












0.60 


0.26 






F 


0.05 0.68 


Gly 


464 






B 








0.94 


-0.17 






F 


0.65 0.76 


Leu 


465 






B 








0.94 


-0.60 






F 


1.10 1.50 


Ser 


466 


A 












0.86 


-0.67 


* 




F 


0.95 0.86 


Glu 


467 


A 












1.14 


-0.69 






F 


1.101.16 


Lys 


468 


A 












1.19 


-0.73 






F 


1.10 L89 


Gly 


469 


A 






T 


T 




1.58 


-1.41 






F 


1.70 2.82 


Ser 


470 


A 








T 




1.58 


-1.80 






F 


1.30 3.26 


Ser 


471 


A 








T 




1.29 


-1.11 






F 


130 134 


Lys 


472 


A 








T 




0.99 


-0.61 






F 


1.30137 


Lys 


473 






B 








0.73 


-0.66 






F 


1.10 137 


Leu 


474 






B 








0.77 


-0.61 






F 


1.10 1.58 


Ala 


475 






B 




. 




0.77 


-0.51 


* 




F 


1.40 1.14 


Ser 


476 






B 




T 




0.77 


-0.13 






F 


1.45 0.77 


Pro 


477 






B 


* 


T 




0.77 


0.26 






F 


130 124 


Thr 


478 








T 


T 




0.72 


-0.43 






F 


2.60 2.46 


Ser 


479 










T 


c 


122 


-0.53 






F 


3.00 2.95 


Ser 


480 








T 


T 




1.00 


-0.43 


* 


* 


F 


2.60 2.76 


Lys 


481 






B 




T 




1.41 


-0.17 


* 


* 


F 


1.90 1.58 


Asn 


482 






B 




T 




1.28 


-0.66 


* 


* 


F 


1.90 230 


Thr 


483 






B 




T 




138 


-0.61 


• 


* 


F 


1.94 1.70 


Leu 


484 






B 








133 


-0.57 


* 


* 


F 


1.78 132 


Arg 


485 






B 








132 


-0.14 


* 


* 


F 


1.67 0.81 


Gly 


486 






B 




T 




132 


-0.06 




* 


F 


2.21 0.81 


Pro 


487 








T 


T 




1.37 


-0.54 




* 


F 


3.40 1.96 


Gly 


488 








T 


T 




1.72 


-1.23 




* 


F 


3.06 2.00 


Thr 


489 










T 


c 


1.94 


-1.23 




* 


F 


2.52 4.05 
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Lys 


490 




A 


B 






Lys 


491 




A 


B 






Lys 


492 




A 


B 






Ala 


493 




A 


B 




5 


Arg 


494 




A 


B 






Val 


495 




A 


B 






Gly 


496 






B 






Pro 


497 






B 






His 


498 






B 




10 


Val 


499 






B 






Arg 


500 






B 






Gin 


501 






B 






Gly 


502 






- 


T 




Lys 


503 


A . 




* 




15 


Arg 


504 


A 




• 






Arg 


505 






B 






Lys 


506 






B 






Ser 


507 






B 






Leu 


508 






B 




20 


Lys 


509 






B 






Ser 


510 












His 


511 








f 




Ser 


512 








■ 




Gly 


513 








T 


25 


Arg 


514 








T 




Met 


515 












Arg 


516 












Pro 


517 












Ser 


518 


A 








30 


Ala 


519 


A 










Glu 


520 


A 


A 








Gin 


521 


A 


A 








Lys 


522 


A 


A 








Arg 


523 


A 


A 






35 


Ala 


524 




A 




T 




Gly 


525 








T 




Arg 


526 








T 




Ser 


527 












Leu 


528 










40 


Pro 


529 






B 






Ter 


530 








T 
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1.94 


-1.16 




* 


F 


1.58 2,65 






1.43 


-1.59 


* 


* 


F 


1.24 5.24 






1.30 


-137 


* 




F 


0.90 2.69 






1.43 . 


-1.43 


* 


* 


F 


0.90 1.33 






1.71 


•1.00 


• 


* 


F 


0.90 1.03 






0.81 


-0.50 


* 


* 


F 


0.75 0.70 


T 




0.88 


0.14 


* 


* 


F 


0.25 0.52 


T 




0.83 


-036 


* 


* 


F 


0.85 0.52 


T 




1.08 


0.04 


# 


* 


F 


0.74 1.20 


T 




1.01 


-0.17 


* 


* 


F 


1.68 1.20 


T 




1.98 


-0.60 


* 


* 


F 


232 1.55 


T 




2.43 


-1.03 


• 




F 


2.66 2^4 


T 




2.69 


-1.53 


* 




F 


3.40 5.90 


T . 




2.42 


-2.17 






F 


2.66 6.03 






2.47 


-1.79 






F 


2.12 4.66 






2.40 


-1.50 






F 


1.78 3.89 






2.10 


-1.93 






F 


1.44 3.89 






2.41 


-1.54 






F 


1.44 2.66 






2.07 


-1.04 






F 


1.78 1.85 






1.61 


-0.66 






F 


2.12 1.24 




C 


1.61 


-0.23 


* 


* 


F 


2.21 0.91 


T 




0.97 


-0.61 


* 


* 


F 


3.40 2.17 


T 


C 


138 


-0.69 




* 


F 


2.86 1.07 


T 




1.98 


-0.69 




* 


F 


3.02 1.57 


T 




1.63 


-0.64 




* 


F 


2.98 1.78 




C 


1.34 


-0.76 




* 


F 


2.54 1.78 


T 


C 


138 


-0.64 




* 


F 


2.70 1.82 


T 


C 


1.68 


-1.07 




* 


F 


3.00 1.61 


T 




2.07 


-0.67 




* 


F 


2.50 2.82 


T 




2.07 


-1.29 




* 


F 


2.20 2.88 






2.08 


-1.29 


* 


* 


F 


1.50 3.65 






1.62 


-1.21 


* 


* 


F 


1.51 2.75 






1.94 


-1.17 


* 




F 


1.52 2.69 






1.94 


-1.67 


* 




F 


1.83 3.04 






1.72 


-1.29 


* 




F 


2.54 236 


T 




1.51 


-1.00 


* 




F 


3.10 0.97 


T 




1.12 


-0.57 


• * 




F 


2.79 0.77 


T 


C 


0.69 


-0.14 


* 




F 


1.98 0.97 


T 


c 


0.19 


-0.21 


* 






1.67 1.25 






039 


-0.21 


* 






0.81 0.82 






0.34 


0.21 


* 






030 0.78 
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Table II 



Res Position 


I 


II 


in 


IV V 


VI 


VII 


VIII 


IX 


X 


XI 


XII 


xm xiv 


Met 


1 












C 


0.69 


-0.24 




* 




1.19 1.74 


Arg 


2 












C 


038 


-0.24 




* 




1.53 1.34 


Pro 


3 










T 


C 


0.88 


0.11 








132 0.91 


Pro 


4 








T 


T 




\21 


-031 








2.61 1.80 


Gly 


5 








T 


T 




0.96 


-0.53 






F 


3.40 1.48 


Phe 


6 








T 


T 




0.74 


0.26 






F 


2.01 0.83 


Arg 


7 




A 


B 








-0.18 


0.51 








0.42 0.44 


Asn 


8 




A 


B 








-0.78 


0,77 


* 






0.08 037 


Phe 


9 . 




A 


B 








-1.16 


1.03 








-0.26 035 


Leu 


10 




A 


B 








-Ml 


0.74 








-0.60 0.18 


Leu 


11 




A 








C 


-0.71 


1.13 








-0.40 0.15 


Leu 


12 




A 








C 


-1.63 


Ml 


* 






-0.40 0.23 


Ala 


13 




A 








C 


-2.44 


1.01 








-0.40 0.23 


Ser 


14 




A 








C 


-2.44 


1.01 








-0.40 0.23 


Ser 


15 




A 








C 


-2.22 


1.11 








-0.40 0.24 


Leu 


16 




A 


B 








-1.76 


0.93 








-0.60 074 


Leu 


17 




A 


B 








-1.76 


0.86 








-0.60 0.18 


Phe 


18 




A 


B 








-1.47 


1.16 








-0.60 0,11 


Ala 


19 




A 








C 


-1.76 


1.16 








-0.40 0.18 


Gly 


20 




A 








C 


-2.31 


0.97 








-0.40 022 


Leu 


21 




A 








C 


-1.71 


0.93 


* 






-0.40 0.19 


Ser 


22 




A 








C 


-0.90 


0.57 


* 






-0.40 029 


Ala 


23 




A 








c 


-0.50 


0.47 


* 






-0.40 0.51 


Val 


24 




A 








c 


-0.61 


0.43 


* 






-0.40 0.83 


Pro 


25 










T 


c 


-0.57 


0.53 


« 




F 


0.15 0.53 


Gin 


26 








T 


T 




0.03 


0.53 


* 




F 


035 0.71 


Ser 


27 








T 


T 




0.03 


0.46 


* 




F 


0.50 1.48 


Phe 


28 










T 


c 


-0.19 


0.20 


* 


* 


F 


0.60 1.28 


Ser 


29 










T 


c 


0.78 


0.46 


* 


* 


F 


0.15 0.61 


Pro 


30 










T 


c 


0.69 


0.06 


* 


* 


F 


0.450.89 


Ser 


31 








T 


T 




0.40 


0.06 


» 


* 


F 


0.80 1.38 


Leu 


32 








T 


T 




0.49 


0.19 


* 


* 


F 


0.80 1.08 


Arg 


33 








T 






0.84 


0.23 


* 


* 


F 


0.60 1.08 


Ser 


34 








T 






0.56 


0.23 


* 


* 


F 


0.45 0.80 


Trp 


35 










T 


c 


0.18 


0.34 


* 


* 


F 


0.45 0.98 


Pro 


36 










T 


c 


-0.19 


0.16 




* 


F 


0.45 0.50 


Gly 


37 








T 


T 




0.73 


0.73 


* 


* 




0.20 0.20 
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/Via 


38 


* 






T 


T 




-0.19 


034 


* 


* 




0.50 0.38 


Ala 


39 


* 


• 




* 




C 


■0.19 


0.11 


* 


• 




0.10 0.20 


Cys 


40 




B 




T 




• 


0.21 


0.07 


* 


■ 




0.30 0.27 


Arg 


41 


A 


B 








• 


-0.17 


-0.36 


* 


• 




0.30 0.53 


Leu 


42 


A 










C 


0.18 


-036 


* 


• 




0.50 0.53 


Ser 


43 


A 










c 


0.47 


-0.86 


* 


* 




0.95 1.70 


Arg 


44 


A 










c 


1.06 


-1.04 


* 




F 


1.10 1.16 


Ala 


45 


A 










c 


1.83 


•1.04 




* 


F 


1.10 2.44 


GIu 


46 


A 






T 






1.83 


-1.73 




* 


F 


1.30 3.57 


Ser 


47 


A 






T 






1.98 


-2.11 






F 


1.30 3.57 


Glu 


48 


A 






T 






2.39 


-1.54 






F 


1.30 1.89 


Arg 


49 


A 






T 






1.69 


-2.04 




* 


F 


1.30 2.14 


Arg 


50 


A 






T 






2.07 


-1.54 


* 




F 


1.58 1.62 


Cys 


51 


A 






T 






1.72 


-1.50 


• 


* 




1.71 1.44 


Arg 


52 


A 






T 






2.02 


-1.07 


* 


* 


F 


1.99 0.73 


Ala 


53 








* 


T 


c 


1.81 


-0.67 


* 


* 


F 


2.47 0.64 


Pro 


54 








T 


T 




1.49 


-0.24 




* 


F 


2.80 1.86 


Gly 


55 








T 


T 




1.03 


-039 




* 


F 


2.52 1.47 


Gin 


56 










T 


c 


1.11 


0.04 


* 


* 


F 


1.44 1.44 


Pro 


57 








• 


T 


c 


0.41 


0.04 




* 


F 


1.01 0.94 


Pro 


58 








T 


T 




0.19 


0.11 






F 


0.93 0.96 


Gly 


59 








T 


T 




-0.27 


0.37 






F 


0.65 0.46 


Ala 


60 




B 




* 


T 




0.04 


0.54 








-0.20 0.16 


Ala 


61 




B 




- 






-0.30 


0.61 




* 




-0.40 0.14 


Leu 


62 




B 




- 


* 




0.02 


0.61 




* 




-0.40 0.14 


Cys 


63 








T 






-0.11 


0.19 




* 




0.61 0.27 


His 


64 








T 


T 




0.34 


O.U 




* 




1.12 0.26 


Gly 


65 








T 


T 




0.27 


-039 




* 


F 


2.18 0.63 


Arg 


66 








T* 


T 




0.86 


-0.50 




* 


F 


2.49 0.63 


Gly 


67 








T 


T 




1.00 


-1.07 




* 


F 


3.10 0.77 


Arg 


68 








T 


* 




132 


-1.00 




* 


F 


2.59 0.42 


Cys 


69 








T 


T 




0.50 


-1.00 




* 




2.33 0.21 


Asp 


70 








T 


T 




0.18 


-0.36 




* 




1.72 0.16 


Cys 


71 








T 


T 




-0.82 


-0.21 








1.41 0.04 


Gly 


72 








T 


T 




-1.14 


0.47 








0.200.06 


Val 


73 






B 


T 






-1.29 


0.47 








-0.20 0.02 


Cys 


74 




B 


B 








-1.48 


0.97 








-0.60 0.05 


He 


75 




B 


B 








-1.79 


1.04 








-0.60 0.03 


Cys 


76 




B 


B 








-1.12 


1.10 








-0.60 0.07 


His 


77 




B 


B 








-0.99 


0.46 








-0.60 0.22 


Val 


78 






B 


T 






-0.48 


0.31 








0.10 0.48 
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Thr 


79 




B 




Glu 


80 




• 




Pro 


81 




T 




Giy 


82 




T 


5 


Met 


83 




T 




Phe 


84 




T 




Phe 


85 




T 




Gly 


86 




• 




Pro 


87 




• 


10 


Leu 


88 




T 




Cys 


89 




T 




Glu 


90 


A 


T 




Cys 


91 


A 


T 




His 


92 


A 


T 


15 


Glu 


93 


A 


T 




Trp 


94 


A 


T 




Val 


95 


A 


T 




Cys 


96 


A 


T 




Glu 


97 


A 


T 


20 


Thr 


98 


A 


T 




Tyr 


99 




T 




Asp 


100 




T 




Gly 


101 




T 




Ser 


102 




T 


25 


Thr 


103 




T 




Cys 


104 




T 




Ala 


105 




T 




Gly 


106 




T 




His 


107 




T 


30 


Gly 


108 




T 




Lys 


109 




T 




Cys 


110 




T 




Asp 


111 




T 




Cys 


112 




T 


35 


Gly 


113 




T 




Lys 


114 




T 




Cys 


115 




T 




Lys 


116 




T 




Cys 


117 




T 


40 


Asp 


118 




T 




Gin 


119 




T 
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C 


-0.41 


0.06 . . 


F 


0.05 0.89 


T 


C 


-012 


0.10 * . 


F 


0.45 0.64 


T 




-016 


039 . . 


F 


0.65 0.75 


T 




-0.57 


033 . . 




0.20 0.45 


T 




0.08 


0.47 * . 




010 0.26 


* 




-0.42 


0.90 . . 




0.00 016 


• 




-1.09 


1.16 . . 
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T 




2,09 




T 


- 




2.03 






T 




1.71 




T 


T 




136 




T 


t 




050 




T 


T 




0.87 


B 


T 






0.28 


B 






C 


0.31 


B 






C 


0.52 


B 






c 


0.22 










-0.37 










0.41 










0.77 




T 






0.91 




T 






1.21 



-1.07 


* * 


F 


3.062.28 


-0.50 


* 


F 


2.07 0.62 


-0.11 


* 


F 


1.73 0.53 


-0.44 


* * 




1.24 0.39 


-0.33 






1.10032 


0.10 






0.50 0.19 


0.50 


» 


F 


0.35 038 


0.29 




F 


0.87 038 


0.11 


» 


F 


0.89 0.46 


0.30 


* * 




0.96 0.25 


0.30 


* * 




138 0.18 


-0.20 






2.20 036 


r0.06 






1.98 0.11 


-0.46 


* * 




2.04 0.34 


-0.46 


* * 




1.90 0.11 


-0.86 


* * 




2.46 0.43 


-0.93 


* * 




2.52 0.44 


-0.93 


* * 




2.80 0.22 


-0.01 


* * 




2.22 0.43 


0.17 


* * 




1.14 0.40 


0.60 


* 




0.56 0.75 


0.03 


* 




0.92 0.75 


0.03 


* 




1.13 1.48 


0.10 


* 


F 


1.67 0.80 


-0.33 


* 


F 


2.76132 


-0.74 




F 


3.401.15 


-1.07 




F 


3.06 1.79 


-1.10 




F 


252 155 


-0.71 




F 


2.18 1.04 


-0.14 




F 


1.74 1.04 


-0.23 




F 


1.25 0.87 


0.46 




F 


035 052 


0.34 




F 


0.25 0.45 


0.41 






-0.40 0.27 


0.01 






-0.10 0.34 


-037 






0.65 1.11 


-0.56 


* * 


F 


0.90 1.03 


-036 


* * 


F 


0.45 0.87 


-1.04 


* * 


F 


0.90138 


-1.43 


* * 


F 


1.64 253 


-1.36 


* * 


F 


1.83 0.78 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCT/US99/25031 



227 



Arg 325 . A . . T 

Lys 326 . A . . T 

Cys 327 . . . . T 

Gin 328 .... T 

5 Gly 329 .... T 

Ser 330 .... T 

Scr 331 .... T 

Asp 332 .... T 

Leu 333 

10 Pro 334 . T 

Cys 335 .... T 

Ser 336 .... T 

Gly 337 . . . T 

Arg 338 . . . T 

15 Gly 339 .... T 

Lys 340 .... T 

Cys 341 .... T 

Glu 342 . . . T 

Cys 343 .... T 

20 Gly 344 .... T 

Lys 345 . . T 

Cys 346 .... T 

Thr 347 .... T 

Cys 348 .... T 

25 Tyr 349 B 

Pro 350 

Pro 351 .... T 

Gly 352 .... T 

Asp 353 .T* 

30 Arg 354 . . T 

Arg 355 .... T 

Val 356 .... T 

Tyr 357 .... T 

Gly 358 .... T 

35 Lys 359 . . . T 

Thr 360 .... T 

Cys 361 .... T 

Glu 362 .... T 

Cys 363 .... T 

40 Asp 364 .... T 

Asp 365 .... T 



• 


• 


1.46 


-0.96 


* 


• 


F 


2.17 0.78 






1.16 


-0.53 


* 




F 


2.51 0.58 


T 




0.86 


-0.53 


* 




F 


3.401.11 


T 




1.16 


-0.83 


* 


* 


F 


Z91 0.76 


T 




123 


-0.83 


* 


* 


F 


2.570.63 


T 


- 


0.91 


-0.14 


* 


* 


F 


1.93 0.97 






0.20 


-0.29 




* 


F 


1.39 0.87 






0.57 


-0.11 


* 


+ 


F 


1.05 0.47 




C 


0.22 


-0.16 


* 


* 


F 


1.16 0.47 




. 


0.68 


-0.11 


* 


* 


F 


1.67 0.35 


T 




0.63 


-0.50 




* 


F 


2.18 0.41 


T 




0.98 


-0.07 




* 


F 


2.49 0.49 


T 




0.31 


-0.76 




* 


F 


3.10 0.63 


T 




1.12 


-0.61 




* 


F 


2.79 0.63 


. 




0.67 


-1.19 




* 


F 


2.56 0.82 






0.99 


-1.00 


* 


* 


F 


2.53 0.44 


T 




1.33 


-1.00 


* 


* 


F 


2.70 0.22 


T 




L01 


-1.00 


* 


* 




2.52 0.45 


T 




0.59 


-0.86 




* 




2.80 0.12 


T 




0.27 


-0.37 








2.22 033 






-0.02 


-037 








1.74 0.10 






0.43 


039 








0.86 0.29 






0.22 


0.24 








0.58 0.46 






0.54 


0.24 




* 




0.64 036 






0.89 


0.67 




* 




0.28 0.66 


T 


C 


0.96 


0.10 






F 


1.47 0.76 


T 




1.73 


-039 






F 


2.76 2.78 


T 




1.19 


-0.96 






F 


3.40 3.47 


T 




1.61 


-1.07 






F 


3.06 1.67 






1.51 


-0.74 




* 


F 


2.52 1.69 






1.77 


-0.74 




* 


F 


2.18 1.69 






1.67 


-1.17 




* 




1.69 102 






134 


-0.69 




* 




135 1.49 


T 




134 


-0.11 




* 


F 


1.25 0.41 


T 




0.57 


-0.11 






F 


1.25 0.95 


T 




0.46 


-0.19 




* 


F 


1.59 0.33 


T 




131 


-0.94 




* 


F 


2.23 0.55 






1.67 


-137 








2.22 0.46 


T 




2.12 


-137 








2.76 0.62 


T 




1.41 


-1.86 






F 


3.40 2.28 


T 




1.72 


-1.86 






F 


2.91 0.70 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



228 



PCT/US99/25031 



10 



15 



20 



25 



30 



35 



40 



Arg 

Arg 

Cys 

Glu 

Asp 

Leu 

Asp 

Gly 

Val 

Val 

Cys 

Gly " 

Gly 

His 

Gly 

Thr 

Cys 

Ser 

Cys 

Gly 

Arg 

Cys 

Val 

Cys 

Glu 

Arg 

Gly 
Tip 
Phe 
Gly 
Lys 
Leu 
Cys 
Gin 
His 
Pro 
Arg 
Lys 
Cys 
Asn 
Met 



366 

367 

368 

369 

370 

371 

372 

373 

374 

375 

376 

377 

378 

379 

380 

381 

382 

383 

384 

385 

386 

387 

388 

389 

390 

391 

392 

393 

394 

395 

396 

397 

398 

399 

400 

401 

402 

403 

404 

405 

406 



B 
B 



B 
B 
B 



A 
A 



T 


T 


• 


239 


-1.86 




F 


3.03 22% 


T 




• 


1.58 


-X43 




F 


180 228 


T 




• 


2.24 


-1.74 




F 


2.771.12 


T 




* 


1.90 


-1.74 




F 


2.59 0.96 


T 


T 


• 


1.04 


-131 




F 


3.10 0.48 


T 


T 


* 


0.08 


-0.67 




F 


2.79 0.67 


T 


T 




-0.70 


-0.(50 




F 


2.48 0.29 


T 


T 


• 


-038 


-0.03 






1.72 0.09 






• 


-0.72 


0.40 






0.21 0.11 






• 


-0.76 


0.14 






-0.10 0.07 


T 


T 


- 


-0.29 


0.64 






0.20 0.09 


T 


T 




-0.60 


0.64 






0.20 0.12 


T 


T 


• 


-0.92 


0.49 




F 


0.35 0.23 


T 


T 




-037 


0.41 




F 


035 0.23 


T 






-0.18 


0.23 




F 


0.45 032 


T 




• 


0.14 


037 




F 


0.45 0.17 


T 


T 


• 


0.60 


037 






050 0.12 


T 


T 


- 


028 


-0.13 






1.28 0.25 


T 


T 


- 


-0.54 


0.01 






0.86 0.09 


T 


T 




-0.87 


0.17 






1.04 0.13 


T 


* 


- 


-0.56 


0.17 






1.02 0.05 


T 




• 


0.22 


-0.21 ' 






1.80 0.16 








0.18 


-0.79 * 






152 0.32 






• 


0.56 


-0.79 < 


* * 




1.34 0.16 


T 


T 




020 


0.13 * 






0.86 032 


T 


T 




-0.26 


0.34 






0.68 038 


T 


T 


• 


0.46 


0.13 . 






0500.69 


T 


T 


• 


050 


-0.44 






1.10 0.80 


T- 




• 


0.50 


0.24 * 






0.30 0.34 


T 






050 


0.81 « 






0.00 0.18 


T 




• 


0.36 


0.79 « 






0.00 030 


T 




- 


0.49 


037 * 


* 




0.64 0.47 


T 






0.89 


0.01 * 


* 




0.98 0.74 


T 






1.63 


-0.41 * 






1.92 0.72 




T 


C 


1.31 


-0.41 * 


* 




2.41 1.75 


T 


T 




1.27 


-0.53 * 


* 


F 


3.40 1.75 


T 


T 




1.48 


-0.70 * 




F 


3.06 1.63 


T 


T 




1.83 


-0.49 




F 


2.42 1.18 


T 






1.83 


-050 ♦ 






2.03 1.11 






C 


1.87 


-0.93 






1.14 0.98 






c 


2.08 


-0.93 * 




F 


0.95 0.85 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCT/US99/25031 



229 





Thr 


407 




A 






* 




Glu 


408 


A 


A 






* 




Glu 


409 


A 


A 






• 




Gin 


410 




A 






T 


5 


Ser 


411 




* 






T 




Lys 


412 










T 




Asn 


413 










• 




Leu 


414 














Cys 


415 










T 


10 


Glu 


416 










T 




Ser 


417 










T 




Ala 


418 










T 




Asp 


419 










T 




Gly 


420 










T 


15 


He 


421 






B 








Leu 


422 






B 








Cys 


423 










T 




Ser 


424 










T 




Gly 


425 










T 


20 


Lys 


426 










T 




Gly 


427 










T 




Ser 


428 










T 




Cys 


429 










T 




His 


430 










T 


25 


Cys 


431 










T 




Gly 


432 










T 




Lys 


433 










T 




Cys 


434 










T 




lie 


435 




A 






T 




Cys 


436 




A 


B 








Ser 


437 




A 






* 




Ala 


438 




A 






* 




Glu 


439 




A 






T 




Glu 


440 




A 




B 


T 


35 


Trp 


441 




A 




B 


T 




Tyr 


442 




A 




B 


T 




He 


443 








B 


T 




Ser 


444 








B 


T 




Gly 


445 










T 


40 


Glu 


446 










T 




Phe 


447 










T 



* 


C 


1.67 


-0.53 




F 1.44 2.73 


* 




1.67 


-0.71 


* 


F 1.58 2.28 


• 




2.33 


-1.11 


* 


F 1.92 4.61 


■ 




1.52 


-133 


* • 


F 2.66 5.13 


T 




1.46 


-1.13 


* . 


F 3.402.44 


T 




1.77 


-036 


* . 


F 2.91 0.76 


T 


C 


1.47 


-0.56 


* • 


F Z62 0.76 


T 


C 


0.88 


-0.57 


* • 


. 238 0.76 


• 




0.88 


-0.46 


* . 


1.99 038 


• 




0.83 


-0.46 




F 2.05 0.40 


T 




-0.10 


-0.43 


* - 


F 2,50 0.48 


T 




-0.91 


-0.43 


* - 


F 2.25 0.62 


T 




-0.77 


-0.31 


* . 


F 2.00 030 


T 




-0.40 


0.26 


* - 


1.00 0.12 






-0.74 


0.26 


* - 


0.15 0.16 






-0.40 


0.19 


* * 


0.15 0.09 


T 




-0.16 


0.19 


* * 


1.00 0.19 


T 




-0.46 


0.19 


* * 


F 1.40 0.27 


T 




-0.78 


-0.11 


* 


F 2.25 0.43 


T 




0.08 


-0.23 


• 


F 230 0.43 






0.22 


-0.30 


* 


F 2.05 0.44 


• 




0.54 


-0.11 


* * 


F 1.80 0.24 


* 




0.89 


-0.11 


* * 


1.40 0.12 


T 




0.57 


-0.11 




135 0.24 


T 




-0.37 


0.03 


* • 


0.50 0.10 


T 




-069 


0.33 


* • 


0.500.12 


T 




-0.69 


033 


* . 


0.50 0.05 






-0.61 


0.21 


* - 


0.30 0.12 






-0.58 


0.14 


* . 


0.10 0.13 






0.09 


-0.29 


* . 


030 0.11 




c 


0.14 


-0.29 


* 


. 030 035. 




c 


-0.14 


0.06 


* • 


. -0.10 032 






-0.37 


0.13 


• - 


. 0.25 1.53 






0.22 


0.24 




0.10 0.80 






0.54 


0.24 


* 


. 0.25 1.06 






0.84 


0.17 


* 


0.10 0.61 






0.73 


0.17 


♦ 


0.10 0.61 






0.07 


0.96 


* 


-0.20 030 






0.07 


0.61 


• 


F 0.15 0.17 






-0.31 


-0.14 


* 


0.90 0.41 






-0.07 


-0.26 


* 


1.24 0.16 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCT/US99/25031 



Cys 


448 






T 


Asp 


449 






T 


Cys 


450 






T 


Asp 


451 






T 


Asp 


452 






T 


Arg 


453 






T 


Asp 


454 






T 


Cys 


455 






T 


Asp 


456 






T 


Lys 


457 






T 


His 


458 






T 


Asp 


459 






T 


Gly 


460 






T 


Leu 


461 




B 


• 


lie 


462 




B 




Cys 


463 






T 


Thr 


464 






T 


Gly 


465 






T 


Asn 


466 






T 


Gly 


467 






T 


lie 


468 






T 


Cys 


. 469 






T 


Ser 


470 






T 


Cys 


471 






T 


Gly 


472 






T 


Asn 


473 






T 


Cys 


474 






T 


GIu 


475 






T 


Cys 


476 






T 


Tip 


477 






T 


Asp 


478 






T 


Gly 


479 






T 


Trp 


480 






T 


Asn 


481 








Gly 


482 








Asn 


483 








Ala 


484 








Cys 


485 


A 




T 


Glu 


486 


A 


B 




lie 


487 


A 




T 


Trp 


488 


A 




T 



230 



T 




0.82 


-0.64 


* * 




2.08 0.28 


T 




1.23 


-1.07 


* 




2.42 0.27 


T 




1.58 


-1.07 


* 


. 


2.76 0.60 


T 




0.91 


-1.86 


* 


F 


3.40 1.87 


T 




1.61 


-1.86 


* 


F 


2,91 0.60 


T 




132 


-1.86 


* * 


F 


2.72 1.87 


T 




2,29 


-Z43 


* * 


F 


2.72 2.24 


T 




Z96 


-1.93 




F 


2.72 1.83 






2.61 


-1.93 




F 


2.52 1.56 






1.80 


-1.50 


* 


F 


2.71 0.92 


T 




0.80 


-0.81 


* 


F 


3.40 1.42 


T 




0.13 


-0.70 




F 


2.91 0.60 


T 




0.49 


-0.13 






2.12 0.16 


T 




0.14 


0.36 






0.78 0.17 






0.10 


0.29 






0.24 0.10 


T 




-0.21 


0.69 


* 


, 


0.20 0.16 


T 




-1.10 


0.69 


* 


F 


0.35 0.20 


T 




-1.42 


0.69 


* 


F 


0.35 0.20 


T 




-0.91 


0.57 


* 


F 


0.35 0.20 


. 




-0.69 


039 




F 


0.45 0:18 


. 




-0.37 


0.47 


* 




0.00 o.io- 


T 




-0.06 


0.47 


* 




0.42 0.06 


T 




-0.38 


0.47 


* 




0.64 0.10 


T 




-0.38 


0.61 






0.86 0.08 


T 




-0.70 


-0.07 






1.98 024 


T 




-0.10 


-0.07 






2.20 0.10 


T 




0.57 


0.46 






1.08 0.19 


T 




0.52 


-0.11 






1.76 032 


T 




0.90 


-0.11 






1.54 0.20 


T 




1.24 


0.40 






0.42 0.39 


T 




0.90 


023 






0.50 0.36 


T 




1.57 


0.66 




F 


0.35 0.67 . 


T 




0.98 


0.49 




F 


0.501.02 


T 


C 


0.98 


0.07 




F 


0.45 0.62 


T 


C 


1.27 


0.64 


* 


F 


0.15 0.33 


T 


C 


0.38 


0.21 


• 




0.30 0.55 - 


T 


C 


0.43 


-0.01 






0.90 0.24 






-0.09 


0.50 






-0.20 0.25 






-0.43 


0.76 






-0.60 0.13 






-0.39 


0.79 






-0.20 0.13 






-0.39 


0.67 






-0.20 0.32 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCT/US99/25031 



231 



Leu 
Gly 
Ser 
Glu 
Tyr 
Pro 
Ter 



489 
490 
491 
492 
493 
494 
495 



T 
T 



T 
T 
T 
T 



C 
C 
C 



-0.04 0.10 
0.41 0.86 



0.02 
0.52 
0.42 
0.84 
0.80 



0.60 
0.1 1 
-0.14 
-0.14 
-0.10 



F 
F 
F 



-0.10 0.32 
0.56 0.72 
0.72 1.05 
1.23 1.63 
1.89 2.11 
2.10 2.01 
1.89 1.48 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCT/US99/25031 



232 





Res Position 


I 


11 


5 


Met 


1 


A 


A 




Glu 


2 - 


A 


A 




Thr 


3 


A 


A 




Gly 


4 


A 


A 




Ala 


5 


A 


A 


10 


Leu 


6 


A 


A 




Arg 


7 




A 




Arg 


8 








Pro 


9 




A 




Gin 


10 




A 


15 


Leu 


11 




A 




Leu 


12 




A 




Pro 


13 




A 




Leu 


14 




A 




Leu 


15 




A 


20 


Leu 


16 




A 




Leu 


17 




A 




Leu 


18 




A 




Cys 


19 








Gly 


20 






25 


Gly 


21 








Cys 


22 








Pro 


23 








Arg 


24 








Ala 


25 






30 


Gly 


26 








Gly 


27 








Cys 


28 








Asn 


29 








Glu 


30 






35 


Thr 


31 








Gly 


32 








Met 


33 


A 


A 




Leu 


34 


A 


A 




Glu 


35 


A 


A 


40 


Arg 


36 


A 


A 



Table HI 



III 


IV V 


VI 


VII VIII 


• 






. 0.10 


• 






-0.32 








. 0.18 








0.68 


* 






. 0.86 








. 1.46 


B 






. 0.64 


B 






. 0.14 


B 






0.28 


B 
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0.70 








-0.23 0.88 


Ser 


AJj 






o 

D 




T 




-0.99 


0.60 








0.14 0.88 


Asp 


256 








T 


T 




-0.29 


0.60 








0.71 0.46 




9V7 
ZJ / 








T 


T 




0.22 


-0.16 






F 


1.93 0.94 


Leu 


95R 






B 




T 




-0.42 


-0.46 






F 


1.70 0.94 


I /Ml 








B 




* 




0.18 


-0.16 


* 




F 


133 039 


Asp 


ZOU 






B 




* 




0.13 


0.27 


* 




F 


0.56 038 


oer 


ZOl 






B 




• 




-0.08 


030 


* 




F 


039 0.45 


ue 


zoz 






B 




T 




-0.89 


0.04 


* 




F 


0.42 0.85 


no 


zto 






B 




T 




-0.29 


0.04 


* 




F 


0.25 0.42 


Uly 


264 










T 


C 


0.31 


0.47 


* 




F 


0.15 0.48 


Prrv 

rro 


ZOO 










T 


C 


0.01 


031 




* 


F 


030 1.07 


I Mil 


ZOO 












C 


-030 


021 


* 


* 


F 


0.42 0.93 


ITO 


Z07 










T 


c 


0.50 


0.47 


* 


* 


F 


0.49 0.77 


no 


ZOo 








T 


T 




0.41 


0.04 


* 


* 


F 


1.16 0.98 




zo? 






B 




T 




-0.10 


0.00 






F 


1.68 139 


Leu 


270 






n 
O 




T 




0.08 


-0.04 




* 


F 


1.70 0.76 




971 






n 
O 




* 




0.08 


0.03 




* 


F 


0.73 0.67 


Ser 


272 






n 
B 








0.29 


0.29 


- 






0.41 0.41 


Val 


97^ 






D 
O 




* 




030 


030 


• 


* 




0.24 0.87 


His 


274 






n 
Z> 




* 




0.80 


0.01 


- 


* 




0.07 0.71 


Leu 


975 

Z/J 






B 




T 




0.80 


0,41 


* 






-0.20 0.86 


Gin 


276 










T 


c 


-0.20 


0.71 


* 




F 


0.15 0.95 


Asn 


277 










T 


c 


0.10 


0.76 




* 


F 


0.15 0.49 


Asn 


278 










T 


c 


0.64 


0.26 


* 


* 


F 


0.60 1.03 


Leu 


279 


A 


A 










0.08 


0.06 


* 






-030 0.86 


He 


280 


A 


A 










0.89 


027 


* 


* 




-030 0.53 


Glu 


281 




A 


B 








1.00 


0.27 








-030 0.57 


Thr 


282 


A 


A 










1.00 


-0.13 








0.45 135 
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Met 


283 




A 


B 




Gin 


284 




A 


B 




Arg 


285 




A 


B 




Asp 


286 




A 


. 


5 


Val 


287 




A 


B 




Phe 


288 


A 


A 






cys 


289 


A 


A 






Asp 


290 


A 


A 






Pro 


291 


A 


A 




10 


Glu 


292 


A 


A 






Glu 


293 


A 


A 






His 


294 


A 


A 






Lys 


295 


A 


A 






His 


296 


A 


A 




15 


Thr 


297 


A 


A 






Arg 


298 


A 


A 






Arg 


299 


A 


A 






Gin 


300 


A 


A 






Leu 


301 


A 


A 




20 


Glu 


302 




A 


B 




Asp 


303 




A 


B 




lie 


304 




A 


B 




Arg 


305 




A 


B 




Leu 


306 




A 


B 


25 


Asp 


307 










Gly 


308 










Asn 


309 










Pro 


310 






. 




He 


311 






B 


30 


Asn 


312 






B 




Leu 


313 






B 




Ser 


314 






B 




Leu 


315 






B 




Phe 


316 






B 


35 


Pro 


317 










Ser 


318 










Ala 


319 






B 




Tyr 


320 






B 




Phe 


321 






B 


40 


Cys 


322 






B 




Leu 


323 






B 



T 
T 







0.14 


-0.81 






F 


0.90 3.21 






0.26 


-0.86 






F 


0.90 1.38 






0.48 


-0.07 






F 


0.45 0.83 






0.48 


0.01 








0.100.45 






0.58 


-0.60 








0.60 0.43 






1.18 


-0.57 








0.600.34 






1.18 


-0.57 








0.600.35 






1.03 


-0.57 






F 


0.75 0.82 






1.08 


-0.71 






F 


0.90 1.30 






1.90 


-1.50 






F 


0.90 4.83 






2.29 


-1.57 






F 


0.90 3.94 






3.07 


-1.09 






F 


0.90 3.67 






3.18 


-1.51 






F 


0.904.16 






339 


-1.51 


* 




F 


0.90 4.70 






2.58 


-1.11 


* 




F 


0.90 5.98 






2.58 


-0.93 


* 




F 


0.90 2.47 






2.61 


-0.93 


* 




F 


0.903.14 






1.68 


-1.43 


* 


* 


F 


0.90 3.63 






1.82 


-1.23 


* 


* 


F 


0.90 1.30 






132 


-13 


* 


* 


F 


0.90 130 






1.21 


-034 


* 


* 


F 


0.75 0.62 






0.76 


-0.94 


* 


* 


F 


Ml 1.25 






0.76 


-1.20 




* 


F 


1.17 0.72 






1.36 


-0.80 


* 


* 


F 


138 0.69 


T 




0.47 


-0.37 


* 




F 


224 152 


T 


C 


0.47 


-037 




* 


F 


2.10 0.54 


T 


C 


0.54 


0.03 


* 


* 


F 


1.44 1.06 


T 


C 


0.13 


0.03 


* 


* 


F 


1.08 0.52 






0.13 


0.41 






F 


0.17 0.71 






-0.57 


0.67 








-0.19 036 






-0.43 


1.06 




* 




-0.40 0.20 






-0.73 


1.06 




* 




-0.40 0.45 






-1.11 


0.76 




* 




-0.40 0.37 


T 




-0.47 


0.86 




* 




-0.20 0.46 


T 




-1.17 


0.93 








0.200.54 


T 




-1.02 


133 




* 




0.200.56 


T 




-1.53 


1.21 








-0.20 035 






-0.93 


1.11 


* 






-0.40 0.19 






-0.12 


1.11 


* 






-0.40 0.21 






-0.72 


0.73 


* 






-0.40 0.42 






-0.63 


0.91 




* 




-0.40 0.22 
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5 



10 



Pro 


324 


B 






-0.93 


0.59 


* 




-0.40 0.39 


Arg 


325 


B 


B 




-1.03 


0.49 






-060 0 si 


Leu 


326 


B 


B 




-022 


0.34 






4.30 0.61 


Pro 


327 




B 


T 


-026 


-0.34 


» 




0.70 0.78 


lie 


328 




B 


T 


. 024 


0.01 






0.10 0.34 


Gly 


329 


B 


B 




. 0.07 


0.50 


* 


F 


-0.60 0.60 


Arg 


330 


B 


B 




-0.43 


0.24 


* 


F 


-0.15 0.50 


Phe 


331 


B 


B 




-0.01 


0.24 


* 




-0.30 0.91 


Thr 


332 


B 


B 




-0.19 


-0.01 


* * 




0.45 1.17 


Ter 


333 


B 


B 




. 0.31 


-0.01 


* * 




030 0.76 
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Table V 



Res Position 


I 


11 


III 


IV 


V 


VI 


VII 


VIII 


IX 


X 


XI 


XII 


XIII XIV 


Met 


1 




A 


B 










•130 


0.7Q 








•v.uv VO" 


Leu 


2 




A 


B 










-1.72 


0.96 








A CC\ a ne 
-v.OU U.ZO 


Leu 


3 




A 


B 










-2.14 


1.21 








-V.OU v. lO 


Pro 


4 




A 


B 










-2.06 


1.47 








A (Lf\ A 1 A 


Leu 


5 




A 


B 










-1.97 


1.24 








A £(\ a *v» 
-U.OU U.ZZ 


Leu 


6 




A 


B 










-2.18 


0.94 








-v.ou UOO 


Leu 


7 




A 


B 










-2.18 


0.94 








A ce\ n to 
-v.OU U. 1 y 


Ser 


8 




A 


B 










-1.71 


1.20 








A £f\ A 1 A 

-U.OU 0. 1 y 


Ser 


9 






B 


B 








-1.84 


0.94 






p 




Leu 


10 






B 


B 








-1.33 


0.69 






p 


A AC A *V7 


Leu 


11 








B 






c 


-0.52 


0.39 






p 


ft A *57 
U.UJ u.z/ 


Gly 


12 












T 


c 


-0.30 


0.40 






p 
r 


a a< n i< 
U.Jj 


Gly 


13 












T 


C 


-0.60 


0.51 






p 




Ser 


14 






B 






T 




-0.30 


0.44 






p 

r 


0 19 ft S7 


Gin 


15 






B 






T 




0.17 


-0.24 




* 




1 tO ft ft» 

1.17 U.OO 


Ala 


16 






B 










1.09 


-0.24 




* 


p 


1 1 A ft OS 


Met 


17 






B 






T 




0.73 


-0.67 




* 


p 


1 OR 1 7ft 

1 .70 I .XO 


Asp 


18 






B 






T 




0.79 


-0.27 




* 


F 




Gly 


19 










T 


T 




0.20 


0.24 


* 


* 


p 


1 33 0.67 


Arg 


20 










T 


T 




0.31 


0.43 


* 


* 




fl 71 ft AH 

U. 1 J U.*f / 


Phe 


21 






B 


B 








0.04 


-0.19 


* 


* 




0.64 0.56 


Tip 


22 






B 


B 








0.64 


0.46 


* 






-0.43 0.42 


lie 


23 






B 


B 








0.64 


0.43 


* 


* 




-0.60 037 


Arg 


24 






B 


B 








0.69 


0.43 


* 


* 




-0.60 0.74 


Val 


25 






B 


B 








-0.28 


0.03 


* 


* 




-030 0.94 


Gin 


26 








B 






C 


-0.18 


-0.24 


* 


* 


F 


0.65 0.99 


GIu 


27 








B 






C 


-0.74 


-0.31 


* 


* 


F 


0.65 0.50 


Ser 


28 








B 


T 






-0.07 


0.33 




* 




0.100.50 


Val 


29 






B 


B 








-0.18 


0.1 1 




* 




-030 0.45 


Met 


30 






B 


B 








0.09 


-0.29 








030 0.45 


Val 


31 






B 


B 








-0.58 


0.21 








-030 0.34 


Pro 


32 






B 


B 








-0.58 


0.40 








-0.30 0.24 


Glu 


33 






B 










-1.17 


-0.24 








0.50 0.41 


Ala 


34 










T 






-0.61 


-0.17 








0.90 0.39 


Cys 


35 






B 










-0.87 


-0.43 


* 






0.50 0.34 


Asp 


36 








B 


T 






-0.22 


-0.21 


* 






0.70 0.14 
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10 



15 



20 



15 



10 



15 



0 



He 


37 


fi 


B 


• 




Ser 


38 


B 


B 


• 




Val 


39 


B 




. 


T 


Pro 


40 


B 






T 


Cys 


41 






T 


T 


Ser 


42 






T 


T 


Phe 


43 






T 




Ser 


44 


B 






, 


Tyr 


45 


B 




. 


T 


Pro 


46 






T 


T 


Arg 


47 






T 


T 


Gin 


48 






T 


T 


Asp 


49 






T 




Tip 


50 






T 


T 


Thr 


51 






. 


T 


Gly 


52 






T 


T 


Ser 


53 








T 


Thr 


54 










Pro 


55 






. 


T 


Ala 


56 






T 


T 


Tyr 


57 






T 


T 


Gly 


58 






T 


T 


Tyr 


59 


B 


B 






Trp 


60 


B 


B 






Phe 


61 


B 


B 






Lys 


62 


B 


B 






Ala 


63 


B 


B 






Val 


64 


B 


B 






Thr 


65 


B 


B 






Glu 


66 


B 


B 






Thr 


67 




B 


T 




Thr 


68 






T 


T 


Lys 


69 






T 


T 


Gly 


70 








T 


Ala 


71 








T 


Pro 


72 


B 








Val 


73 


B 








Ala 


74 


B 








Thr 


75 


B 








Asn 


76 










His 


77 








T 



c 
c 



c 
c 



-0.68 
-0.98 
-1.09 
-0.72 
-0.97 
-0.29 
0.12 
. 0.98 
1.19 
137 
136 
1.91 
1.91 
1.84 
1.84 
1.14 
0.90 
0.56 

0.60 

0.62 

0.27 

0.61 

033 

-031 

-0.03 

0.21 

0.24 

0.18 

0.51 

0.87 

023 

0.61 

0.61 

033 

0.02 

033 

0.61 

0.57 

0.61 

131 

1.52 



021 

029 

0.10 

0.89 

0.59 

0.96 

0.74 

031 

0.14 

-0.24 

-0.11 

-0.01 

-0.34 

-039 

0.10 

0.13 

0.63 

0.47 

0.41 

0.74 

127 

1.57 

1.14 

1.14 

1.03 

1.09 

033 

-0.10 

-0.40 

-0.40 

-0.47 

-0.61 

-0.67 

-0.03 

-0.0 1 

-0.01 

039 

0.46 

036 

031 

-0.33 



F 
F 



-030 022 
-030 022 
0.10 0.18 
-0.20 0.22 
0.20 0.22 
0.20 0.46 
028 0.46 
0.61 1.69 
1.09 2.19 
2.52 422 
2.80 331 
232 3.05 
2.04 1.95 
1.96 133 
0.881.11 
0.80 1.03 
0.15 0.99 
0.10 1.07 
0.30 1.07 
035 1.26 
0.20 0.91 
020031 
-0.45 1.01 
-0.60 0.65 
-0.60 0.49 
-0.60 0.45 
-030 0.74 
0.79 124 
1.13 0.89 
1.62 1.77 
236X36 
3.40 1.65 . 

3.06 1.48 

2.07 0.76 
1.73 0.53 
0.84 038 
-0.10 0.62 
-0.10 0.84 
0.50 0.94 
1.30 1.70 
2.40 3.29 
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Gin 


78 




* 






T 


C 


1.52 


-0.83 


* 




F 


3.00 3.95 




Ser 


79 










T 


C 


2.11 


-0.67 


* 




F 


2.70 1.82 




Arg 


80 




B 






T 




1.82 


-1.07 


* 




F 


2.20 232 




Glu 


81 


A 


B 






• 




1.52 


-0.96 


* 




F 


1.50133 


5 


Val 


82 


A 


B 






• 




1.24 


-0.97 


* 




F 


1.54 133 




Glu 


83 


A 


B 






• 




136 


-0.87 


* 






1.23 0.98 




Met 


84 


A 


B 










131 


-0.87 


* 






1.77 1.10 




Ser 


85 




* 






T 


C 


131 


-0.44 


* 




F 


2.56 1.47 




Thr 


86 








T 


T 




0.61 


-1.09 






F 


3.40 1.67 


10 


Arg 


87 








T 


T 




1.47 


-030 






F 


2.76 1.46 




Gly 


88 








T 


T 




0.66 


-0.51 






F 


2.72 1.88 




Arg 


89 




B 


B 








0.94 


-0.21 






F 


1.28 1.08 




Phe 


90 




B 


B 








0.90 


-0.21 








0.64 0.79 




Gin 


91 




B 


B 








1.21 


0.21 








-0.30 0.79 


15 


Leu 


92 




B 


B 








0.89 


-0.21 








030 0.68 




Thr 


93 




B 


B 








0.64 


0.21 






F 


034 1.21 




Gly 


94 






B 






C 


0.58 


-0.07 






F 


133 0.70 




Asp 


95 










T 


C 


0.93 


-0.47 


* 




F 


2.22 1.71 




Pro 


96 










T 


C 


0.93 


-0.73 






F 


2.86 1.17 


20 


Ala 


97 








T 


T 




1.08 


-0.81 






F 


3.40 1.90 




Lys 


98 








T 


T 




1.09 


-0.67 






F 


2.91 0.61 




Gly 


99 








T 


T 




0.62 


-0.29 


* 




F 


2.27 0.53 




Asn 


100 




B 




T 


T 




-0.23 


-0.03 






F 


1.93 0.43 




Cys 


101 




B 






T 




-0.91 


0.11 


* 






0.44 0.16 


25 


Ser 


102 




B 






T 




-0.21 


0.80 


* 






-0.20 0.11 




Leu 


103 


A 


B 


B 








-0.26 


037 


* 






-030 0.14 




Val 


104 


A 


B 


B 








-0.50 


-0.03 


* 






030 0.43 




He 


105 


A 


B 


B 








-0.50 


-0.10 


* 






030 033 




Arg 


106 


A 


B 


B 








-0.43 


-0.09 


* 






030 0.68 


30 


Asp 


107 


A 


B 


B 








-0.13 


-0.16 


* 






030 0.91 




Ala 


108 


A 


B 










0.68 


-0.40 


* 






0.45 2^5 




Gin 


109 


A 


B 










1.53 


-1.09 


* 






1.03 1.92 . 




Met 


no 


A 


- 








c 


2.12 


-1.09 








1.51 1.99 




Gin 


111 


A 


B 










2.01 


-0.70 






F 


1.74 2.64 


35 


Asp 


112 








T 


T 




1.77 


-0.80 




* 


F 


2.82 2.64 




Glu 


113 








T 


T 




1.66 


-0.44 


* 




F 


2.80 4.18 




Ser 


114 








T 


T 




0.96 


-0.27 


• 


• 


F 


2.52 2.09 




Gin 


115 








T 


T 




1.67 


0.11 


* 


* 


F 


1.64 1.08 




Tyr 


116 




B 


B 








0.81 


0.11 


* 


* 




0.41 1.22 


40 


Phe 


U7 




B 


B 








0.81 


0.76 


* 






-032 0.68 




Phe 


118 




B 


B 








0.92 


037 


* 


* 




0.04 0.68 
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Arg 


1 10 


B 


Vnl 




B 




111 




Arg 


1 11 

122 




U!y 


123 


* 


Ser 


124 


* 


Tyr 


125 


B 


Val 


126 


B 


Arg 


127 


B 


Tyr 


128 


B 


Asn 
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* 


F 


1.32 135 


5 


Pro 


328 








• 


T 


c 


0.84 


-037 




* 


F 


2.16 1.95 




Pro 


329 








T 






0.86 


-0.19 




* 


F 


2.40 1.20 




Ala 


330 








T 






0.60 


-0.19 




* 


F 


2.16 1.20 




Arg 


331 




B 


B 








0.59 


0.34 




* 


* 


0.42 0.82 




Leu 


332 




B 


B 








0.89 


0.40 




* 


• 


0.180.76 


10 


Ser 


333 




B 


B 








121 


0.37 




* 


• 


0.09 1.30 




Trp 


334 




B 


B 








1.08 


-0.13 




* 




0.45 1.30 




Thr 


335 




B 


B 








1.67 


030 




* 


F 


0.00 1.56 




Gin 


336 




B 






T 




0.70 


0.01 




* 


F 


0.40 2.02 




Arg 


337 




B 






T 




0.70 


0.27 




* 


F 


0.40 1.43 


15 


Gly 


338 




* 




T 


T 




0.70 


0.04 




* 


F 


0.65 0.82 




Gin 


339 




B 






T 




0.78 


-0.06 




* 


F 


0.85 0.63 




Val 


340 




B 










0.79 


-0.03 






F 


0.65 0.50 




Leu 


341 




B 










0.79 


036 






F 


0.05 0.67 




Ser 


342 




B 






T 




0.47 


033 






F 


0.55 0.67 


20 


Pro 


343 








T 


T 




0.51 


036 






F 


1.40 1.41 




Ser 


344 








T 


T 




0.51 


0.10 






F 


1.702.28 




Gin 


345 








• 


T 


c 


1.16 


-0.59 






F 


2.70 2.85 




Pro 


346 








T 






1.62 


-0.54 






F 


3.00 2.85 




Ser 


347 












c 


1.07 


-0.54 






F 


Z50 2.10 


25 


Asp 


348 










T 


c 


0.47 


-0.29 






F 


1.95 0.90 




Pro 


349 




B 






T 




0.77 


0.00 






F 


1.45 0.48 




Gly 


350 




B 






T 




-0.04 


-0.43 






F 


1.15 0.62 




Val 


351 




B 






T 




-0.04 


-0.13 


* 






0.70 031 




Leu 


352 




B 










0.37 


0.30 


* 






-0.10 031 


30 


Glu 


353 




B 










-0.49 


-0.13 


* 






0.50 0.61 




Leu 


354 




B 


B 








-0.28 


0.09 


* 


* 




-030 0.61 




Pro 


355 




B 


B 








-0.79 


-0.16 




* 


F 


0.60 121 




Arg 


356 


A 


B 


B 








0.07 


-020 


* 


• 




030 0,55 




Val 


357 


A 


B 


B 








0.84 


-0.20 








0.45 1.15 


35 


Gin 


358 


A 


B 


B 








0.84 


-039 




• 




0.45 1.01 




Val 


359 


A 


B 


B 








1.31 


-0.81 




* 




0.60 0.89 




Glu 


360 


A 


B 


B 








1.52 


-0.39 




* 




0.45 1.19 




His 


361 


A 










c 


0.71 


-1.03 




* 


F 


1.10 1.19 




Glu 


362 


A 






T 






1.26 


-0.64 




* 


F 


130 139 


40 


Gly 


363 


A 






T 






0.59 


-0.80 




* 


F 


130 1.16 




Glu 


364 


A 






T 






1.41 


-0.23 




* 


F 


0.85 0.46 
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Phe 


365 


A 




Thr 


366 


A A 




Cys 


367 


A 




His 


368 


A 


B 


Ala 


369 


A 




Arg 


370 


A 




His 


371 






Pro 


372 






Leu 


373 






Giy 


374 




• 


Ser 


375 




B 


Gin 


376 




B 


His 


377 




B 


Val 


378 




B 


Ser 


379 




B 


Leu 


380 




B 


Ser 


381 




B 


Leu 


382 




B 


Ser 


383 




B 


Val 


384 




B 


His 


385 




B 


Tyr 


386 




B 


Ser 


387 




B 


Pro 


388 




B 


Lys 


389 






Leu 


390 






Leu 


391 




B 


Gly 


392 






Pro 


393 






Ser 


394 






Cys 


395 






Ser 


396 


A 


- 


Tip 


397 


A 


B 


Glu 


398 


A 


B 


Ala 


. 399 


A 




Glu 


400 


A 




Gly 


401 


A 




Leu 


402 


A 




His 


403 


A 




Cys 


404 






Ser 


405 







254 





T 






0.82 




• 






0.97 




T 






0.93 




• 






1.07 




T 






026 




• 




C 


0.61 




• 


T 


C 


0.62 




T 


T 




129 




T 


T 




129 




T 


T 




1.02 


B 








0.61 


B 








-0.17 


B 








-0.26 


B 








-0.26 


B 








-0.21 


B 








-0.77 


B 








-0.80 


B 








-1.01 


B 








-0.46 


B 








-037 


B 








049 


B 








-0.02 






T 




-0.02 






T 




-0.07 




T 


T 




038 




T 


T 




031 






- 




-0.11 






T 


c 


-0.11 






T 


c 


-0.19 






T 


c 


-0.23 






T 


c 


-0.01 








c 


0.80 










0.80 










0.20 




T 






0.47 




T 






0.47 




T 






0.47 




T 






0.09 




T 






-0.21 




T 


T 




0.08 




T 


T 




0.08 



-0.23 


* * 




0.70 036 


0.27 






-0.30 037 


-0.16 


* * 




0.70 0.41 


0.34 


* * 




-030 0.65 


-0.01 


* * 




0.95 0.70 


0.19 






035 1.07 


0.04 






1.05 0.78 


-0.07 






2.25 1.03 


-0.17 




F 


2.50 0.91 


033 




F 


1.65 0.91 


0.47 




F 


0.30 0,44 


0.43 




F 


0.05 0.71 


0.43 






-0.35 0.59 


039 






-030 039 


0.69 






-0.60 0.28 


0.67 






-0.60 0.28 


0.81. 






-0.60 0.28 


0.67 


♦ 




-0.60 0.28 


1.04 






-0.60 034 


0.74 


* 




-0.60 0.54 


0.79 


* * 




-0.45 1.01 


0.10 


* * 




-0.15 130 


0.40 


♦ 




0.25 1.67 


0.44 


* * 


F 


0.10 1.01 


037 


* * 


F 


0.65 0.64 


0.04 


* 


F 


0.65 0.74 


0.04 




F 


0.05 0.64 


0.19 




F 


0.45 0.17 


0.57 




F 


0.15 0.28 


0.80 




F 


0.15 036 


0.11 


* 




0.300.62 


0.19 


* 




-0.10 0.41 


-0.24 






0.30 0.52 


-020 






0.30 0.97 


-0.09 






0.700.60 


0.03 




F 


0.10 0.77 


-031 






0.70 0.24 


0.07 






0.10 032 


0.14 






0.100.10 


0.53 






0.20 0.13 


0.49 






0.20 0.22 



SUBSTITUTE SHEET (RULE 26) 
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PCT/US99/25031 





Cys 


406 








T 




Ser 


407 








T 




Ser 


408 








T 




Gin 


409 








T 


5 


Ala 


410 












Ser 


411 












Pro 


412 












Ala 


413 












Pro 


414 










10 


Ser 


415 












Leu 


416 




B 








Arg 


417 




B 


B 






Trp 


418 




B 


B 






Trp 


419 


A 


. 


B 




15 


Leu 


420 


A 




B 






Gly 


421 


A 




B 






Glu 


422 


A 


B 








Glu 


423 


A 


B 








Leu 


424 


A 








20 


Leu 


425 


A 










Glu 


426 


A 






T 




Gly 


427 


A 






T 




Asn 


428 








T 




Ser • 


429 










25 


Ser 


430 












Gin 


431 












Asp 


432 


A 










Ser 


433 


A 


B 








Phe 


434 


A 


B 






30 


Glu 


435 


A 


B 








Val 


436 


A 


B 








Thr 


437 




B 








Pro 


438 












Ser 


439 








T 


35 


Ser 


440 












Ala 


441 












Gly 


442 












Pro 


443 








T 




Trp 


444 










40 


Ala 


445 




B 








Asn 


446 




B 







255 



T 




-0.17 


0.20 






. 


0.50 0.27 


T 




0.34 


020 




* 


F 


0.65 032 






0.17 


0.01 






F 


0.45 0.52 






0.24 


0.06 






F 


0.60 1.49 




C 


033 


-0.01 






F 


1.24 1.13 




C 


0.70 


0.03 






F 


0.88 130 




c 


0.19 


0.03 




* 


F 


1.12 1.00 


T 


c 


0.60 


031 




* 


F 


1.41 0.82 


T 


c 


0.31 


-0.19 


* 


* 


F 


2.40 1.20 


T 


c 


0.61 


0.34 


* 


* 


F 


1.41 0.82 


T 




0.10 


0.83 


* 


* 




0.52 0.85 






-0.03 


1.01 


* 


« 




-0.12 0.45 




. 


0.56 


1.01 




* 




-036 033 




c 


0.77 


0.63 




* 




-0.40 0.70 




c 


0.26 


-0.06 




* 




0.50 0.62 




c 


0.26 


0.63 








-0.40 0.49 






0.14 


0.40 




* 


F 


-0.15 0.38 






0.09 


-0.51 






F 


0.75 0.80 




c 


038 


-0.77 


* 




F 


0.95 0.80 




c 


0.89 


-0.80 






F 


1.25 0.74 






0.93 


-0.41 






F 


1.45 0.58 






0.93 


-0.03 






F 


1.75 0.94 


T 




0.93 


-031 






F 


2.60 1.97 


T 


c 


1.44 


-1.00 






F 


3.00 1.89 


T 


c 


1.56 


-0.61 






F 


2.70 2,57 


T 


c 


1.56 


-0.26 






F 


2.10 138 




c 


1.04 


-0.66 


* 




F 


1.70 1.79 






0.73 


-0.40 


* 




F 


0.75 0.99 






0.82 


-030 








0.30 o;82 






0.82 


-0.27 








0.43 0.76 






0.52 


0.11 






F 


0.11 0.76 


T 




-0.07 


0.11 






F 


0.79 1.18 


T 


c 


-0.11 


-0.17 






F 


1.57 0.69 


T 




0.38 


0.26 






F 


130 0.92 


T 


c 


0.09 


0.04 






F 


0.97 0.98 




c 


0.36 


0.47 






F 


0.34 0.67 


T 


c 


0.67 


0.54 






F 


0.41 0.50 


T 




0.58 


0.56 






F 


0.48 0.61 


T 


c 


0.58 


0.56 


* 




F 


0.15 0.80 


T 




0.07 


0.44 


* 




F 


0.10 1.09 


T 




036 


0.70 






F 


-0.05 0.58 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



256 



PCT/US99/2503I 



10 



15 



20 



25 



30 



35 



40 



Ser 


447 




B 


Ser 


448 




B 


Leu 


449 




B 


Ser 


450 




B 


Leu 


451 




B 


His 


452 




B 


Gly 


453 






Gly 


454 






Leu 


455 






Ser 


456 






Ser 


457 




. 


Gly 


458 




B 


Leu 


459 




B 


Arg 


460 


A 


B 


Leu 


461 


A 


B 


Arg 


462 


A 


B 


Cys 


463 


A 


B 


GIu 


464 


A 


B 


Ala 


465 


A 




Trp 


466 


A 


. 


Asn 


467 




B 


Val 


468 




B 


His 


469 






Gly 


470 






Ala 


471 






Gin 


472 






Ser 


473 






Gly 


474 




B 


Ser 


475 




B 


He 


476 




B 


Leu 


477 




B 


Gin 


478 




B 


Leu 


479 




B 


Pro 


480 




B 


Asp 


481 






Lys 


482 






Lys 


483 






Gly 


484 




B 


Leu 


485 




B 


He 


486 




B 


Ser 


487 




B 



B 
B 
B 
B 
B 



T 
T 



T 
T 
T 



T 
T 
T 

T 
T 
T 
T 

T 
T 
T 
T 



T 
T 
T 
T 

T 
T 
T 
T 



T 
T 
T 
T 



-0.11 

0.07 

0.01 

0.26 

-0.56 

-0.56 

-0.56 

-0.09 

-0.60 

0.32 

-0.46 

0.00 

-032 

0.49 

0.20 

0.21 

0.56 

0.51 

037 

0.83 

0.13 

0.80 

0.50 

0.74 

0.73 

-0.16 

-0.11 

-0.08 

-0.54 

-0.17 

-0.17 

0.18 

0.57 

032 

0.60 

0.52 

0.22 

0.72 

034 

-036 

-0.70 



0.66 

0.43 

0.43 

0.46 

0.50 

0.80 

0.50 

0.50 

0.24 

0.43 

0.00 

0.26 

-0.43 

-0.24 

-0.63 

-0.56 

-033 

0.07 

0.03 

0.53 

039 

0.89 

0.79 

0.26 

0.29 

0.03 

0.21 

0,47 

037 

0.66 

0.70 

0.27 

-0.11 

-0.80 

-1.06 

-0.77 

-0.77 

-0.81 

-033 

0.17 

0.96 



F 
F 
F 
F 
F 



F 
F 
F 
F 
F 
F 



-0.05 0.74 
-0.05 0.60 
-0.20 0.51 
-0.40 038 
-0.20 028 
-0.20 0.28 
0.20 0.28 
0.15 0.45 
039 033 
0.43 0.27 
1.67 0.54 
0.81 034 
1.40 0.79 
0.86 032 
1.02 0.56 
0.88 0.68 
0.44 037 
-030 0.71 
0.10 0.27 
-0.20 0.68 
0.10039 
-0.20 039 
0.00 0.64 
0.30 034 
025 0.72 
0.45 0.71 
0.45 0.50 
-0.05 0.41 
025 0.41 
-0.40 0.25 
-0.06 0.39 
038 0.49 
1.87 139 
2.66338 
3.40 1.93 
3.06 1.93 
2.17 0.88 
1.43 0.70 
0.79 0.51 
-0.30 0.26 
-0.60 0.22 



SUBSTITUTE SHEET (RULE 26) 
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257 



Thr 


488 






B 


B 








-0.74 


0.91 


* 




-0.60 0.36 


Ala 


489 






B 


B 




. 




-0.74 


0.63 


* 


. 


-0.60 0,84 


Phe 


490 






B 






T 




-0.52 


0.37 




F 


0.25 0.62 


Ser 


491 












T 


C 


-033 


0.49 


* 


F 


0.15 0.43 


Asn 


492 












T 


c 


-0.84 


0.79 




F 


0.15 0.37 


Gly 


493 






t 






T 


c 


-0.88 


0.97 




F 


0.15 0.35 


Ala 


494 








B 






c 


-1.18 


0.61 






-0.40 0.26 


Phe 


495 






B 


B 








-0.82 


0.91 






-0.60 0.11 


Leu 


496 






B 


B 








-Ml 


0.94 






-0.60 0.11 


Gly 


497 






B 


B 








-1.72 


UO 






-0.60 0.08 


He 


498 






B 


B 








-1.97 


1.19 






-0.60 0.13 


Gly 


499 






B 


B 








-2.19 


0.90 






-0.60 0.16 


He 


500 




A 


B 










-230 


0.90 






-0.60 0.13 


Thr 


501 




A 


B 










-2.19 


1.16 






-0.60 0.16 


Ala 


502 




A 


B 










-2.66 


1.26 






-0.60 0.14 


Leu 


503 




A 


B 










-2.43 


1.51 






-0.60 0.16 


Leu 


504 




A 


B 










-2.90 


1.40 






-0.60 0.06 


Phe 


505 




A 


B 










-2.60 


1.60 






-0.60 0.05 


Leu 


506 




A 


B 










-3.10 


1.60 






-0.60 0.06 


Cys 


507 


A 


A 












-3.40 


1.60 






-0.60 0.06 


Leu 


508 


A 


A 












-3.48 


1.60 






-0.60 0.05 


Ala 


509 


A 


A 












-3.27 


1.50 


* 




-0.60 0.04 


Leu 


510 


A 


A 












-2.52 


1.43 


* 




-0.60 0.08 


lie 


511 




A 


B 










-2.60 


0.86 


* 




-0.60 0.19 


He 


512 




A 


B 










-2.74 


0.86 


* 




-0.60 0.13 


Met 


513 




A 


B 










-2.14 


1.04 


* 




-0.60 0.13 


Lys 


514 




A 


B 










-1,51 


0.79 


* 




-0.26 0.28 


He 


515 




A 


B 










-0.59 


0.10 






0380.81 


Leu 


516 




A 


B 










0.41 


-059 


i 




1.77 1.61 


Pro 


517 












T 


c 


0.99 


-1.20 




F 


2.86 157 


Lys 


518 










T 


T 




1.59 


-0.71 


i 


* F 


3.40 3.24 


Arg 


519 










T 


T 




1.23 


-1.00 


. * F 


3.06 6.80 . 


Arg 


520 










T 


T 




2.12 


-1.20 




F 


2.72 6.34 


Thr 


521 










T 






Z62 


-1.63 




F 


2.18 5.49 


Gin 


522 






B 










2.62 


-1.14 


* 


F 


1.44 4.05 


Thr 


523 






B 










2.69 


-0.71 


* 


F 


1.10 3.20 


Glu 


524 






B 










2.37 


-0.71 


* 


* F 


1.10 4.34 


Thr 


525 






B 






T 




2.37 


-0.77 




* F 


1.30 3.87 


Pro 


526 












T 


c 


1.98 


-1.17 




F 


1.50 5.26 


Arg 


527 












T 


c 


1.68 


-0.87 


* 


* F 


1.84 2.63 


Pro 


528 










T 


T 




2.10 


-0.49 


* 


F 


2.08 2.44 
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Arg 


529 




Phe 


530 




Ser 


531 




Arg 


532 


B 


His 


533 


B 


Ser 


534 


B 


Thr 


535 


B 


tie 


536 


B 


Leu 


537 


B 


Asp 


538 


B 


Tyr 


539 


B 


lie 


540 


B 


Asn 


541 


B 


Val 


542 


B 


Val 


543 


B 


Pro 


544 


B 


Thr 


545 


B 


Ala 


546 


B 


Gly 


547 


B 


Pro 


548 


B 


Leu 


549 


B 


Ala 


550 


B 


Gin 


551 


B 


Lys 


552 


B 
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1.81 
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1.24 

1.26 

1.47 

0.57 

0.96 

0.57 

0.57 

0.77 

038 

0.23 

0.43 

0.04 



1.11 

1.04 

0.71 

0.86 

037 

0.11 

-0.29 

-0.64 

-1.07 
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-1.03 
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Table VI 



Res Position 


I 


II 


in 


IV 


V 


VI 


VII 


VIII 


IX 


X XI XII 


XIII XI 


Met 


1 




A 


B 










-1.90 


0.96 




-0 60 021 


Leu 


2 




A 


B 










-232 


1.21 




-0.60 0.13 


Leu 


3 




A 


B 










-2.74 


1.47 




-0 60 0 09 


Leu 


4 




A 


B 










-2.57 


1.73 




-0.60 0.07 


Leu 


5 




A 


B 










-2.99 


1.54 




-0.60 0.14 


Leu 


6 




A 


B 










-3.20 


1.54 




-0.60 0.14 


Leu 


7 




A 


B 










-Z68 


1.54 




-0.60 0.14 


Pro 


8 




A 


B 










-2.21 


1.77 




-0.60 0.17 


Leu 


9 




A 


B 










-1.29 


1.51 




-0.60 0.21 


Leu 


10 




A 


B 










-0.48 


0.83 




-0.60 0.49 


Tip 


11 




A 










C 


0.44 


0.14 




-0.10 035 


Gly 


12 




A 










C 


0.40 


-0.29 


* F 


0.80 1.31 


Arg 


13 




A 










C 


0.61 


-033 


* F 


0.80 1.18 


Glu 


14 




A 


B 










1.08 


-1.01 


* F 


0.90 1.94 


Arg 


15 




A 


B 










1.89 


-1.50 


. * F 


1.24 1.94 


Val 


16 




A 


B 










2.22 


-1.53 


. * F 


1.58 1.71 


Glu 


17 




A 






T 






2.27 


-1.53 


. * F 


232 1.98 


Gly 


18 










T 






2.16 


-1.14 


* * F 


2.86 135 


Gin 


19 










T 


T 




2.27 


-0.74 


. * F 


3.40 2.93 


Lys 


20 












T 


C 


2.20 


-139 


. * F 


2.86 332 


Ser 


21 










. 


T 


c 


3.06 


-139 


. . F 


2.86 6.70 


Asn 


22 










T 


T 




2.81 


-1.81 


* . F 


3.06 6.46 


Arg 


23 










T 


T 




2.86 


-1.46 


* . F 


3.06 5.06 


Lys 


24 










T 


T 




2.04 


-1.07 


* . F 


3.06 5.06 


Asp 


25 










T 


T 




1.69 


-0.77 


* . F 


3.40 2.60 


Tyr 
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B 






T 




1.39 


-0.69 




2.51 1.91 


Ser 


27 






B 


B 








1.39 


-0.07 




1.32 0.95 . 


Leu 


28 






B 


B 








0.98 


0.33 




038 0.98 


Thr 


29 






B 


fi 








0.63 


0.71 




-0.26 0.84 


Met 


30 






B 






T 




-0.22 


0.34 




0.10 0.84 


Gin 


31 






B 






T 




-0.29 


0.60 


. * F 


-0.05 0.76 


Ser 


32 






B 






T 




-0.84 


0.40 


. * F 


0.25 0.76 


Ser 


33 






B 






T 




-0.03 


0.56 


. * F 


-0.05 0.57 


Val 


34 






B 


B 








0.28 


0.34 


. * F 


-0.15 0:57 


Thr 


35 






B 


B 








0.53 


-0.06 


. * F 


0.45 0.73 
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-0.01 


. . F 


0.45 0.54 
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132 0.76 


Trp 


74 


A 


B 








0.41 
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-0.60 0.75 
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-0.49 
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-0.30 036 


Pro 


77 




B 








0.40 


0.47 
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020 0.45 
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• 




• 


-0.30 0.55 






0.66 


-0.04 


* 
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0.30 0.94 






0.98 


-0.06 
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F 


0.94 1.29 






1.57 


-0.97 
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1.58 2.50 






1.97 


-1.47 
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1.92 5.62 


T 
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-2.11 
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3.06 6.36 


T 
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2.17 1.43 






0.71 


-0.34 




* 




0.98 0.60 






1.02 


0.09 


* 


* 




0.04 036 




c 


0.70 


0.09 


• 


* 




-0.10 0.30 






0.70 


0.51 


- 






-0.20 0.54 






0.28 


0.13 


* 




F 


0.59 0.69 




c 


1.02 


0.11 


* 


* 


F 


1.08 1.21 






1.06 


-0.57 






F 


2.52 2.93 






1.20 


-0.86 




* 


F 


2.86 4.76 


T 




1.70 


-0.71 






F 


3.40 1.53 


T 




1.23 


-0.23 






F 


2.36 1.43 


T 




0.93 


0.03 






F 


127 0.68 


T 




0.26 


0.01 


* 


* 




0.78 0.63 






0.37 


0.21 


* 


* 




0.04 0.22 






0.68 


0.21 


* 


* 




-0.30 0.27 






0.04 


-0.19 


* 


* 




030 0.84 






0.16 


-0.26 


* 


* 




030 0.59 






0.22 


-0.74 


* 


* 


F 


0.90 139 






0.23 


-0.81 


* 




F 


0.90 1.03 






1.04 


-0.81 


* 




F 


0.90 1.97 






0.76 


-1.50 


* 






1.03 1.68 






1.30 


-1.00 




* 


F 


1.46 1.01 




c 


1.30 


-0.57 




* 


F 


1.79 0.99 


T 




1.06 


-1.07 


* 


* 


F 


2.67 0.99 


T 




0.94 


-0.31 


* 


* 


F 


2.80 1.57 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCT/US99/25031 



Gly 


119 




• 


Arg 


120 


B 


• 


Tyr 


121 


B 


B 


Phe 


122 


B 


B 


Phe 


123 


B 


B 


Arg 


124 


B 


B 


Met 


125 


B 




Glu 


126 






Lys 


127 






Gly 


128 






Asn 


129 






He 


130 






Lys 


131 






Tip 


132 






Asn 


133 


B 




Tyr 


134 


B 




Lys 


135 


B 




Tyr 


136 


• 




Asp 


137 


• 




Gin 


138 


B 


B 


Leu 


139 


B 


B 


Ser 


140 


B 


B 


Val 


141 


B 


B 


Asn 


142 


B 


B 


Val 


143 


B 


B 


Thr 


144 


B 


B 


Ala 


145 


fi 


B 


Leu 


146 


B 


B 


Thr 


147 


B 


B 


His 


148 


B 


B 


Arg 


149 


B 




Pro 


150 


B 




Asn 


151 


• 




He 


152 


B 




Leu 


153 


B 




Ue 


154 


B 




Pro 


155 


B 




Gly 


156 


B 




Thr 


157 


B 




Leu 


158 


B 




Glu 


159 


B 





T 



264 



T 


. 0.13 


-0.14 * 


* F 


2.52 1.02 


T 


. 0.84 


0.26 * 




0.94 0.53 




. 0.54 


0.26 * 




0.41 1.02 




. 0.54 


0.37 • 




0.13 1.02 




. 1.18 


-0.06 * 




0.30 0.90 




. 1.18 


-0.06 * 




0.79 1.15 




1.07 


-0.39 * 




133 1.32 


T 


0.42 


-0.77 * 


* F 


2.72 2.45 


T 


. 1.17 


-0.87 * 


* F 


2.91 0.88 


T 


. 1.58 


-0.87 * 


* F 


3.40 1.77 


T 


. 1.47 


-0.57 * 


* F 


3.06 1.08 




C 1.82 


-0.17 . 


* F 


1.87 0.86 




. 1.87 


0.59 . 




0.83 137 


. 


. 1.58 


0.16 




0.79 1.70 


T 


. 1.92 


0.51 . 




0.19 3.81 


T 


1.92 


-0.17 . 




133 3.18 


T 


2.00 


023 * 




0.97 5.23 


T 


. 1.66 


0.00 . 


* F 


236 2.68 




. 1.09 


-0.01 


* F 


2.40 2.30 




1.09 


•0.13 


* F 


1.41 0.85 




0.48 


0.27 . 




0.42 0.87 




0.12 


0.16 




0.18 0.39 




. -0.22 


0.64 * 




-0.36 032 




. -1.03 


0.74 




-0.60 0.40 




. -134 


0.74 




-0.60 0.24 




- -0.57 


0.84 * 




-0.60 0.47 




-0.16 


0.70 . 




-0.60 0.40 




0.49 


0.30 




-0.15 1.06 




0.49 


0.09 . 




-0.15 1.14 




. 0.46 


0.00 




0.45 1.81 


T 


-0.04 


0.19 




025 1.54 


T 


. -0.34 


0.19 




0.10 0.88 


T 


. 0.26 


039 




0.500.45 


T 


0.22 


031 




0.10 0.36 




. -0.06 


0.74 




-0.40 0.23 


T 


-0.98 


0.80 * < 




-0.20 0.21 


T 


-0.77 


1.09 . < 


* F 


-0.05 0.24 


T 


-1.07 


0.40 . < 


F 


0.25 0.51 


T 


-0.52 


0.10 . . 


F 


0.25 0.97 




-0.38 


-0.16 . . 


F 


0.65 0.62 


T 


. -0.19 


-0.0 1 


F 


0.85 0.34 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



Ser 


160 






T 

i 


Giy 


161 






T 


Cvs 


162 






T 
1 


Phe 


1UJ 


n 
15 


m 
D 








rj 
D 


B 




nail 






B 


T 


1 mi 
i-cu 


lob 


B 


B 


■ 


i nr 


10/ 




B 


T 


Cys 


1 £0 

168 




B 


T 


OCT 


ioy 




B 


T 


Val 
Vol 


I7n 
1 /u 




B 


* 


rro 


171 






T 


lip 


IT? 
1 /i 






T 


Ala 


1*7-3 


B 




* 


i*ys 


i I** 


B 




* 




17^ 
It J 






T 


ftln 

VJIII 


17A 

1 /o 






T 


uiy 


177 
1 / / 






T 


i nr 


17Q 

1 to 








rTO 


no 

i /y 








rTO 


180 




B 




Met 


181 


B 


B 




11a 

lie 


182 


B 


B 




Ser 


183 


B 


B 




Trp 


184 


B 


B 




Ma* 

Met 


1 oc 

185 


B 


B 




vjiy 


loo 




B 


T 


i nr 


15/ 


* 


B 




ocr 


Ififi 

loo 


* 


B 




Val 
v ai 


ISO 
107 


B 


B 




Ser 


ion 

17V 


* 






Pro 


1Q1 

17 J . 








Leu 


iqo 








Hie 
MIS 


\yj 


B 






Pro 


194 


B 






Ser 


195 






T 


Thr 


196 








Thr 


197 


B 






Arg 


198 


B 






Ser 


199 


B 






Ser 


200 


B 







PCT/US99/25031 

265 



T 




o no 


ft 1A 




r 


0.65 0.20 


T 
i 




n 17 
uo/ 


ft 7/i 




r 


0.61 0.42 


T 
1 






ft a*i 

-0.03 






1.02 0.39 






U.J/ 


O.oo 






-0.72 0.24 






-0.30 


0.76 






-0.76 035 






-0.30 


0.90 


* 




-0.40 035 






-0.81 


0.71 


* 




-0.76 055 






-036 


0.57 






-032 0.23 






0.06 


0.60 


* 




-0.28 0.22 






-0.53 


1.11 


* 




-0.24 0.29 




C 


-1.20 


0.93 


* * 




-0.40 0.20 






-039 


1. 01 


* 




0.00 0.20 






-0.08 


0.44 






0.00 0.26 






0.24 


0.46 






-0.12 0.61 






0.23 


0.24 






0.46 039 


T 




0.88 


0.30 




F 


1.49 0.53 


T 




0.88 


-0.19 




F 


237 0.81 


T 




0.57 


-0.26 




F 


2.80 235 


T 


C 


0.27 


-0.21 




F 


232 134 




c 


0.63 


0.47 


* 


F 


0.79 0.54 




c 


0.34 


0.46 


* 


F 


031 0.74 






-0.26 


0.94 


* 




-032 0.54 






-0.26 


1.07 






-0.60 034 






-0.26 


1.07 


• 




-0.60 022 






-0.34 


1.13 


* 




-0.60 032 






-0.99 


0.90 


* 




-0.60 0.61 






-0.69 


0.86 


* 




-0.20 0.34 




c 


-0.01 


0.86 




F 


-025 0.43 




c 


-0.52 


037 




F 


0.05 0.67 






-0.27 


0.44 




F 


-0.45 056 






0.12 


0.51 




F 


-0.05 053 






0.17 


0.46 




F 


-0.05 0.61 






ft IT 

0.17 


0.46 


• 


F 


0.34 1.10 


T 




0.16 


0.30 


* 


F 


0.88 1.19 


T 




1.12 


0.40 


* 


F 


0.82 1.11 


T 




1.12 


-0.03 


* 


F 


236 2.64 


T 


c 


1.03 


-0.33 




F 


2.40 2.60 


T 




0.99 


-0.44 




F 


1.96 2.25 


T 




0.21 


-0.23 




F 


1.72 1.25 


T 




0.11 


0.07 




F 


0.73 0.71 


T 




-0.40 


0.07 




F 


0.49 0.71 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



Val 


201 


B 


g 


Leu 


202 


D 
O 


D 
D 


Thr 


203 . " 


D 


D 
D 


Leu 


204 


D 


B 


lie 


205 


n 

D 


B 


Pro 


206 


n 
D 


B 


Gin 


207 


t> 
D 




Pro 


208 


t> 




Gin 


209 






His 


210 






His 


211 






Gly 


212 






Thr 


213 






Ser 


214 




B 


Leu 


215 


fci 


B 


Thr 


216 


B 


B 


Cvs 


217 


B 


B 


Gin 


218 


B 


B 


Val 


219 


B 


B 


Thr 




B 


B 


Leu 




B 




Pro 


999 


B 




Glv 




* 




Ala 


224 


* 




Glv 


225 


B 




Val 


226 


B 




Thr 


227 


B 




Thr 


228 


B 




Asn 


229 


B 




Are 


230 


b 




Thr 


231 


n 
D 


B 


lie 


232 


B 


B 


Gin 


233 


B 


B 


Leu 




B 


B 


Asn 


235 


B 




Val 


236 


B 




Ser 


237 






Tyr 


238 






Pro 


239 






Pro 


240 






Gin 


241 







T 



PCT/US99/25031 

266 







-0.98 


0.27 






-0.30 0.30 






-0.88 


0.96 






-0.60 0.16 




* 


-0.99 


1.00 


* • 




-0.60 0.18 






-0.90 


1.01 






-0.60 0.42 




* 


-0.60 


0.80 


* 




-0.60 0.79 






0.22 


031 


• 


F 


-0.45 0.95 






1.00 


033 


* 


F 


-0.10 137 




* 


0.97 


034 


* 


F 


0.20 3.04 




* 


1.47 


0.09 


* 


F 


0.60 1.95 


T 


• 


2.06 


0.14 




F 


0.80 1.62 


T 


• 


1.46 


0.13 


• • 


F 


0.80 1.41 


T 


* 


1.14 


039 


• • 


F 


0.65 0.67 


T 


* 


0.69 


0.47 


• • 


F 


035 0.71 




• 


0.69 


0.54 


* 


F 


-0.05 0.28 




* 


-0.13 


0.44 


* 




-0.60 0.49 






-0.41 


0.66 


* 




-0.60 0.25 






-0.88 


0.66 


* 




-0.60 0.27 




■ 


-0.78 


0.96 


* 




-0.60 0.27 




• 


-0.82 


0.70 


* 




-0.60 0.29 




• 


-0.60 


0.64 


* 




-0.60 0.54 


T 


• 


-0.63 


037 






-0.20 033 


T 


• 


-0.82 


0.60 




F 


-0.05 0.42 


T 




-1.13 


0.60 




F 


035 0.21 


T 


C 


-0.59 


0.60 . 




F 


035 0.38 






-0.28 


0.40 < 




F 


0.15 035 




• 


0.64 


037 < 


* 


F 


0.65 037 


T 


• 


034 


-0.06 < 


> 


F 


1.80 1.10 


T 


- 


0.00 


-0.07 * . 


F 


2.00 1.61 


T 


* 


039 


0.19 * . 


F 


1.20 1.52 


T 


• 


0.12 


-0.06 . 




F 


1.60 1.83 




* 


0.98 


0.14 


* 


F 


0.40 1.04 






0.43 


0.06 * 


* 




0.05 1.04 






0.44 


030 • 






-030 0.40 






0.20 


0.69 • 


* 




-0.60 037 


T 




-0.12 


0.96 * 






-0.20 0.82 


T 




-0.02 


0.70 






-0.20 0.73 


T 


c 


0.87 


0.73 


* 




0.15 137 


T 


c 


0.87 


0.44 


* 


F 


030 1.48 




c 


0.87 


0.44 


• 


F 


0.10 3.21 


T 




0.56 


0.49 




F 


0.50 1.97 


T 




0.56 


039 




F 


0.50 1.82 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCT/US99/25031 



267 





Asn 


242 




B 


• 




T 




0.54 


0.47 






F 


-0.05 0.87 




Leu 


243 




B 


* 




T 




-0.07 


033 








-020 0.81 




Thr 


244 




B 


B 








-0.56 


0.74 


* 






-0.60 035 




Val 


245 




B 


B 








-034 


1.13 


* 






-0.60 0.19 


5 


Thr 


246 




B 


B 








-0.69 


1.13 








-0.60 0.39 




Val 


247 




B 


B 








-0.69 


0.87 




* 




-0.60 0.27 




Phe 


248 




B 


B 








-0.22 


0.39 








-030 0.63 




Gin 


249 




B 


B 








-0.22 


0.17 






F 


0.09 0.43 




Gly 


250 




• 






T 


C 


0.04 


0.17 






F 


0.93 0.84 


10 


Glu 


251 




• 






T 


c 


0.06 


0.03 






F 


1.17 0.98 




Gly 


252 




- 






T 


c 


0.60 


-037 






F 


2.01 0.76 




Thr 


253 










T 


c 


0.71 


-0.29 






F 


2.40 1.11 




Ala 


254 




B 










-0.10 


-0.21 






F 


1.61 0.65 




Ser 


255 




B 










-0.10 


0.47 






F 


0.47 034 


15 


Thr 


256 




B 










-0.10 


0.47 






F 


0.23 037 




Ala 


257 




B 










-0.06 


039 






F 


029 039 




Leu 


258 




B 










-0.04 


0.27 






F 


0.05 039 




Gly 


259 








T 






0.24 


0.27 






F 


0.45 0.55 




Asn 


260 










T 


c 


-0.27 


0.17 






F 


0.45 0.72 


20 


Ser 


261 










T 


c 


-0.26 


036 






F 


0.45 0.72 




Ser 


262 










T 


c 


-0.52 


0.06 






F 


0.45 0.98 




Ser 


263 










T 


c 


-0.52 


0.27 






F 


0.45 0.45 




Leu 


264 


A 


B 










-0.18 


036 






F 


-0.45 0.28 




Ser 


265 


A 


B 










-0.52 


0.17 








-030 036 


25 


Val 


266 


A 


B 










-0.22 


0.21 








-030 0.27 




Leu 


267 


A 


B 










-0.22 


0.23 








-0.13 036 




Glu 


268 


A 


B 










-0.73 


-0.07 


* 




F 


0.79 0.56 




Gly 


269 




* 




T 


T 




0.19 


0.23 


* 




F 


1.16 0.62 




Gin 


270 








T 


T 




-032 


-0.41 


* 




F 


2.08 1.47 


30 


Ser 


271 




B 






T 




-0.32 


-0.41 


* 




F 


1.70 0.70 




Leu 


272 




B 






T 




-0.18 


0O3 


* 




F 


0.93 0.53 




Arg 


273 




B 


B 








-0.77 


037 


* 






0.21 0.16. 




Leu 


274 




B 


B 








-1.28 


0.47 


* 


* 




-0.26 0.12 




Val 


275 




B 


B 








-1.28 


0.73 


* 


* 




-0.43 0.11 


35 


Cys 


276 




B 


B 








-U8 


0.04 


* 


* 




-0.30 0.09 




Ala 


277 




B 


B 








-0.47 


0.43 


* 


* 




-0.60 0.15 




Val 


278 




B 


B 








-0.79 


0.14 


• 


* 




0.00 033 




Asp 


279 








T 


T 




-0.19 


-0.07 






F 


1.85 0.96 




Ser 


280 










T 


c 


0.08 


-0.21 




* 


F 


2.10 1.46 


40 


Asn 


281 










T 


c 


0.86 


-0.21 




* 


F 


2.40 1.99 




Pro 


282 










T 


c 


0.63 


-0.86 




* 


F 


3.00 2.34 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCT/US99/25031 



268 



Pro 


283 






■ 


T 






1.19 


-0.17 




* 


F 


2.40 1.44 


Ala 


9R4 






* 


T 






0.90 


-0.17 




* 


F 


2.101.20 


Arg 






B 


B 








0.89 


034 


* 






030 0.82 


Ijmi 






B 


B 








0.60 


0.40 


♦ 


* 




-030 0.76 


3 Ci 


Zo/ 




B 


B 








0.92 


0.89 


* 


* 




-0.60 0.79 


irp 


ZOO 




B 


B 








0.83 


039 


* 


* 




-030 0.79 


inr 


ion 




B 


B 








0.61 


0.77 


* 






-0.45 1.29 


Tip 


290 




B 


B 








0.19 


0.77 


* 


* 




-0.60 0.79 


Arg 


291 




B 


B 








0.19 


0.87 


* 


* 




•0.45 1.09 


oer 


292 




B 


B 








0.24 


0.64 


* 






-0.60 0.62 


i <».t 

I-CU 


293 






B 


T 






0.32 


0.91 








-0.20 0.93 


Inr 


294 






B 


. T 






0.33 


0.43 


« 






-020 0.73 


Leu 


one 






B 


* 






0.62 


0.81 


♦ 






-0.40 0.73 




29o 




B 




• 


T 




0.30 


0.83 


* 




F 


0.10 1.53 


rTO 


297 








T 


T 




0.30 


0.57 


• 




F 


0.62 1.64 


OCT 


298 








T 


T 




1.11 


0.47 






F 


0.74 2.67 


Oin 


299 










T 


C 


1.21 


0.19 


• 




F 


0.96 2.74 


Pro 


300 








T 


T 




1.21 


^..-0.14 


• 




F 


1.88 2.74 


oer 


301 










T 


C 


0.60 


0.11 


• 




F 


1.20 1.69 


Ash 


302 










T 


C 


0.00 


037 


• 




F 


0.93 0.72 


Pro 


303 




B 






T 




0.30 


0.66 


* 




F 


031 0.39 


Leu 


304 


A 


B 










-0.51 


023 


* 


* 


F 


0.09 0.50 


Val 


305 


A 


B 










-0.30 


0.53 








-0.48 0.26 


Leu 


306 


A 


B 










-0.86 


0.53 


• 


* 




-0.60 0.29 


GIu 


307 


A 


B 










-0.89 


0.74 


• 


m 




-0.60 0.26 


Leu 


308 


A 


B 










-1.49 


0.56 


* 


* 




-0.60 0.47 


Gin 


309 


A 


B 










-1.02 


0.60 


* 


* 




-0.60 0.47 


val 


310 


A 


B 










-0.17 


0.34 


* 


* 




-0.30 027 


HIS 


311 


A 


B 










0.64 


0.34 


* 


* 




-0.30 035 


Leu 


312 


A 










C 


0.30 


-034 


* 


* 




0.84 035 




313 


* 








T 


C 


1.11 


-031 


• 


* 


F 


1.73 0.73 


Asp 


314 


* 






T 


T 




0.41 


-0.96 


• 


* 


F 


237 0.93 


VjlU 


315 


* 






T 


T 




0.96 


-0.67 


* 


* 


F 


2.91 0.97 


vjiy 


3lo 








T 


T 




0.32 


-0.87 


* 


* 


F 


3.40 1.42 


Glu 


317 


A 






T 






124 


-0.73 


* 


* 


F 


2.510.46 


Phe 


318 


A 


B 










1.00 


-0.73 




* 




1.62 0.52 


Thr 


319 


A 


B 










1.00 


-0.23 


* 






0.98 0.53 


Cys 


320 


A 


B 










1.00 


-0.26 


* 


* 




0.89 0.53 


Arg 


321 


A 






T 






1.04 


0.14 


* 


* 




0.60 0.98 


Ala 


322 


A 






T 






0.23 


-0.26 


* 


* 


F 


1.60 0.91 


Gin 


323 


A 






T 






0.59 


-0.06 


* 


* 


F 


2.00 1.40 
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Asn 


324 






• 


T 


T 




0.60 


-0.20 


* * 


F 


2.500.71 


Ser 


325 










T 


C 


1.27 


0.19 


* * 


F 


1.45 0.94 


Leu 


326 




* 


- 




T 


C 


1.12 


0.09 


* * 


F 


1.200.94 


Gly 


327 




• 




T 


T 




0.86 


0.19 




F 


1.15 0.79 


Ser 


328 




B 


B 








0.56 


0.43 




F 


•0.20 0.44 


Gin 


329 




B 


B 








-0.26 


0.43 




F 


-0.45 0.71 


His 


330 




B 


B 








0.04 


0.43 


. * 




-0.60 0.59 


Val 


331 




B 


B 








0.04 


0.40 


. * 




-0.60 0.71 


Ser 


332 


A 


B 










0.09 


0.70 


. * 




-0.60 0.34 


Leu 


333 


A 


B 










-0.42 


0.69 


. * 




-0.60 033 


Asn 


334 


A 


B 










-0.42 


0.87 


. » 




-0.60 037 


Leu 


335 


A 










c 


-039 


0.63 






-0.40 0.48 


Ser 


336 


A 


. 








c 


0.47 


0.64 






-0.25 1.01 


Leu 


337 


A 


B 










0.52 


-0.04 






0.45 1.09 


Gin 


338 


A 


B . 










1.02 


0.31 


. * 


F 


0.00 2.06 


Gin 


339 


A 


B 










0.68 


0.11 


. * 


F 


034 2.22 


Glu 


340 


A 


B 










1.53 


0.16 


. * 


F 


0.68 2.66 


Tyr 


341 




B 






T 




1.23 


-0.53 


* * 


F 


232 3.08 


Thr 


342 








T 


T 




2.16 


-031 


* * 


F 


2.76 1.76 


Gly 


343 








T 


T 




1.94 


-0.71 


* * 


F 


3.40 1.99 


tys 


344 








T 


T 




1.09 


-0.29 


* * 


F 


2.761.96 


Met 


345 




B 










0.79 


-0.40 


. * 


F 


1.82 1.01 


Arg 


346 




B 










0.69 


-0.50 


* * 


F 


1.48 137 


Pro 


347 




B 






T 




0.14 


-0.50 


* * 


F 


1.19 0.68 


Val 


348 




B 






T 




-0.32 


0.14 




F 


0.25 0.51 


Ser 


349 




B 






T 




-1.18 


0.21 


« • 


F 


0.25 0.21 


Gly 


350 




B 






T 




-0.92 


0.90 


* 




-0.20 0.11 


Val 


351 




B 


B 








-1.62 


0.90 


* 




-0.60 0.15 


Leu 


352 




B 


B 








-2.27 


0.76 






-0.60 0.11 


Leu 


353 




B 


B 








-1.76 


1.01 






-0.60 0.09 


Gly 


354 




B 


B 








-1.80 


1.01 


• • 




-0.60 0.11 


Ala 


355 




B 


B 








-2.04 


0.80 






-0.60 0.14 


Val 


356 




B 


B 








-1.53 


0.61 






-0.60 0.17 


Gly 


357 




B 










-1.31 


0.36 






-0.10 0.17 


Gly 


358 










T 


c 


-0.81 


0.43 




F 


0.15 0.17 


Ala 


359 




B 






T 




-1.06 


0.41 




F 


-0.05 033 


Gly 


360 




B 






T 




-1.28 


0.27 






0.10 033 


Ala 


361 




B 






T 




-1.28 


0.53 






-0.20 0.28 


Thr 


362 




B 


B 








-1.63 


0.74 






-0.60 0.20 


Ala 


363 




B 


B 








-2.10 


1.03 






-0.60 0.18 


Leu 


364 




B 


B 








-1.81 


1.29 






-0.60 0.15 
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Val 


365 




B 


B 






-2.17 


1.17 






-0.60 0.14 


Phe 


366 




B B 






-2.24 


1.47 






-0.60 0.12 


Leu 


367 




B B 






-2.79 


1.54 






-0.60 0.08 


Ser 


368 




B B 






-3.09 


1.50 






-0.60 0.08 


Phe 


369 




B B 






-2.98 


1.54 






-0.60 0.06 


Cys 


370 




B B 






-3.01 


134 






-0.60 0.06 


Val 


371 




B B 






-3.17 


1.54 






-0.60 0.03 


lie 


372 




B B 






-3.21 


1.80 






-0.60 0.03 


Phe 


373 




B B 






-2.80 


1.66 






-0.60 0.04 


lie 


374 




B B 






-2.40 


1.09 






-0.60 0.11 


Val 


375 




B B 






-2.40 


0.83 






-0.26 0.20 


Val 


376 




B B 






-1.43 


0.71 






0.08 0.12 


Arg 


377 




B 




T 




-0.50 


-0.07 






1.72 0.35 


Ser 


378 






T 


T 




0.24 


-0.76 




F 


2.91 0.94 


Cys 


379 






T 


T 




0.83 


-1.40 




F 


3.40 2.53 


Arg 


380 






T 


T 




1.10 


-1.66 




F 


3.06 1.73 


Lys 


381 


A 




T 






2.07 


-1.16 




F 


2.32 130 


Lys 


382 


A 




T 






1.74 


-1.54 




F 


1.98 4.76 


Ser 


383 


A 








C 


1.46 


-1.69 




F 


1.44 3.76 


Ala 


384 


A 


B 








1.53 


-1.19 


* * 


F 


0.90 1.90 


Arg 


385 


A 


B 








1.42 


-0.69 


• * 


F 


0.75 0.96 


Pro 


386 


A 


B 








0.52 


-0.69 


* * 


F 


0.90 1.20 


Ala 


387 


A 


B 








0.13 


-0.43 






0.30 0.88 


Ala 


388 


A 


B 








0.43 


-0.50 


* * 




0.30 0.44 


Asp 


389 




B 




T 




0.13 f 


-0.50 


• * 




0.70 0.48 


Val 


390 




B 




T 




-0.32 


-0.24 


* * 




0.70 0.33 


Gly 


391 




B 




T 




-0.71 


-0.31 




F 


0.85 033 


Asp 


392 


• 


B 




T 




-0.08 


-0.20 




F 


0.85 0.19 


He 


393 


A 


B 








0.51 


-0.20 




F 


0.45 0.52 


Gly 


394 


A 


B 








-0.08 


-0.84 




F 


0.75 0.88 


Met 


395 


A 


B 








0.78 


-0.77 




F 


1.01 0.53 


Lys 


396 


A 


B 








0.81 


-037 




F 


1.12 1.22 


Asp 


397 




B 




T 




-0.08 


-037 


* * 


F 


2.08 1.78 


Ala 


398 




B 




T 




0.92 


-031 


* * 


F 


2.04 1.26 


Asn 


399 




B 




T 




0.92 


-0.93 


* * 


F 


2.60 1.23 


Thr 


400 




B 




T 




1.22 


-030 


* * 


F 


L89 0.73 


lie 


401 




B 




T 




039 


-0.11 


* * 


F 


1.63 0.97 


Arg 


402 




B 




T 




0.29 


-0.11 




F 


1.61 0.61 


Gly 


403 




B 




T 




0.88 


-0.13 


* * 


F 


139 0.57 


Ser 


404 








T 


C 


0.53 


-0.21 


* * 


F 


1.92 1.40 


Ala 


405 










c 


0.84 


-0.47 


» * 


F 


1.81 0.71 
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Ser 


406 








T 


C 


0.92 


-0.07 


* * 


F 


2.40 1.15 




Gin 


407 








T 


C 


0.50 


0.19 


• 


F 


1.41 0.71 




Gly 


408 








T 


C 


0.84 


0.29 




F 


132 t.01 




Asn 


409 








T 


C 


0.84 


-0.21 




F 


1.68 1.30 


5 


Leu 


410 










C 


1.14 


-0.21 




F 


1.58 1.01 




Hit 


411 




B 




T 




0.86 


0.30 


* 


F 


1.08 1.07 




Glu 


412 




B 




T 




0.86 


037 


* 


F 


1.27 0.67 




Ser 


413 




B 




T 




120 


-0.03 


* 


F 


236 136 




Trp 


414 








T 




\20 


-0.71 


* 


F 


3.40 138 


10 


Ala 


415 










c 


1.80 


-0.80 


* * 


F 


2.661.46 




Asp 


416 












2.22 


-0.37 


* 


F 


2.56 1.69 




Asp 


417 










c 


2.19 


-0.76 


* * 


F 


2.66 3.15 




Asn 


418 








T 


c 


2.46 


-1.17 


* * 


F 


2.86 4.24 




Pro 


419 








T 


c 


2.40 


-1.17 




F 


2.86 3.45 


15 


Arg 


420 








T 




2.18 


-0.74 


* 


F 


3.40 2.05 




His 


421 








T 


c 


1.59 


-0.06 






2.41 1.05 




ffis 


422 


A 








c 


1.00 


0.04 






0.92 0.69 




Gly 


423 


A 


B 








0.97 


0.11 






038 0.35 




Leu 


424 


A 


B 








0.88 


0.61 






-0.26 0.35 


20 


Ala 


425 


A 


B 








0.47 


0.50 






-030 0.35 




Ala 


426 


A 








c 


0.16 


0.39 






0.50 0.47 




His 


427 








T 


c 


0.19 


039 






1.20 037 




Ser 


428 








T 


c 


0.53 


-0.30 




F 


2.25 0.97 




Ser 


429 








T 


c 


1.46 


-0.80 




F 


3.00 1.66 


25 


Gly 


430 








T 


c 


2.04 


-1.30 




F 


2.70 Z39 




Glu 


431 


A 








c 


1.74 


-1.80 




F 


2.00 3.09 




Glu 


432 


A 








c 


1.78 


-1.50 


* 


F 


1.70 1.62 




Arg 


433 


A 


B 








1.83 


-1.49 


* 


F 


1.20 2.83 




Glu 


434 


A 


B 








1.54 


-1.16 


* 


F 


0.90 2.56 


30 


He 


435 


A 


B 








1.68 


-0.66 






0.75 1.49 




Gin 


436 


A 


B 








0.87 


-0.23 


* 




0.45 1.18 




Tyr 


437 


A 


B 






. 


0.57 


0.46 


* 




-0.60 0.5.6 




Ala 


438 










c 


-0.24 


0.84 


* 




-0.05 1.07 




.Pro 


439 




B 








-0.28 


0.94 


* 




-0.40 0.54 


35 


Leu 


440 




B 








0.66 


1.04 


* 




-0.40 0.47 




Ser 


441 










c 


0.31 


0.29 


♦ 




0.10 0.92 




Phe 


442 










c 


0.56 


021 






0.10 0.59 




His 


443 






T 






0.93 


-0.21 






1.05 1.24 




Lys 


444 






T 






1.14 


-0.47 




F 


1.50 1.43 


40 


Gly 


445 










c 


1.96 


-0.46 




F 


1.60 2.86 




Glu 


446 










c 


1.44 


-1.24 


* 


F 


2.20 3.51 
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Pro 


447 




• 




C 


1.84 


•1.06 


* 




F 


2.50 1.45 


Gin 


448 




T 






1.53 


-0.67 






F 


3.00 1.96 


Asp 


449 








C 


1.49 


-0.67 






F 


2.50 1.12 


Leu 


450 






T 


C 


1.83 


-0.27 






F 


2.10 1.26 


Ser 


451 






T 


C 


1.24 


-0.70 






F 


2.10 1.26 


Gly 


452 


B 




T 




1.14 


-0.60 






F 


1.45 0.76 


Gin 


453 






T 


C 


1.14 


-0.11 






F 


1.20 133 


Glu 


454 








C 


1.14 


-0.40 






F 


1.001.60 


Ala 


455 








c 


1.96 


-0.39 






F 


1302.59 


Thr 


456 








c 


2.01 


-0.81 






F 


1.90 2.59 


Asn 


457 






T 


c 


2.06 


-0.46 






F 


2.10 235 


Asn 


458 






T 


c 


2.06 


-0.07 


* 




F 


2.40 3.11 


Glu 


459 






T 


c 


1.17 


-0.57 


* 




F 


3.00 3.73 


Tyr 


460 




T 


T 




1.80 


-037 




* 


F 


2.60 1.63 


Ser 


461 




T 






1.22 


-0.77 




* 


F 


2.40 Z02 


Glu 


462 


B 








1.01 


-0.49 






F 


1.25 0.82 


He 


463 


B 








1.06 


-0.06 






F 


1.22 0.81 


Lys 


464 


B 








0.67 


-0.81 






F 


1.64 \2l 


He 


465 


B 








0.52 


-0.77 




* 


F 


1.76 0.89 


Pro 


466 


B 








0.43 


-0.34 




* 




1.73 1.62 


Lys 


467 




T 






0.04 


-0.60 




* 




2.70 1.04 


Ter 


468 


B 








0.54 


-0.17 


* 






1.73 1.89 
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Table VII 



Res Position 


1 


ii in 


IV 


V 


VI 


VII 


VIII 


IX 


X 


XI 


XII 


xni XI 


Met 


1 




B 






• 




0.43 


0.01 


* 






-0.10 0.79 


Asp 


2 




B 










0.48 


-0.41 


* 






0.65 1.21 


Leu 


3 




fi 






T 




0.06 


-0.41 


* 






0.70 0.94 


Pro 


4 


A 


• 






T 




-0.41 


-0.16 


* 






0.70 0.78 


Arg 


5 




B 






T 




-0.88 


-0.13 


* 




F 


0.85 0.35 


Gly 


6 




B 






T 




-0.87 


0.51 


* 




F 


-0.05 031 


Leu 


7 




B 


B 








-1.16 


033 


* 






-0.30 0.20 


Val 


8 




B 


B 








-0.93 


0.81 


* 






-0.60 0.11 


Val 


9 




B 


B 








-1.53 


131 








-0.60 0.11 


Ala 


10 




B 


B 








-1.94 


1.57 








-0.60 0.11 


Trp 


11 




B 


B 








-2.41 


1.27 








-0.60 0.20 


Ala 


12 




B 


B 








-1.89 


131 








-0.60 0.22 


Leu 


13 




B 


B 








-1.24 


1.59 








-0.60 0.23 


Ser 


14 












C 


-0.73 


1.51 








-0.20 034 


Leu 


15 












C 


-0.84 


1.03 








-0.20 034 


Trp 


16 










T 


c 


-0.87 


1.31 


* 






0.00 0.35 


Pro 


17 










T 


c 


-0.28 


1.11 


* 






0.00 038 


Gly 


18 








T 


T 


• 


0.22 


0.73 


* 




F 


035 0.77 


Phe 


19 








T 


T 


• 


-0.18 


0.53 


* 




F 


0.50 1.06 


Thr 


20 












c 


0.63 


0.40 


* 




F 


0.25 0.59 


Asp 


21 












c 


032 


037 




* 


F 


0.25 0.96 


Thr 


22 










- 


c 


0.53 


0.56 


* 


* 


- 


0.29 1.10 


Phe 


23 




B 










0.57 


-0.23 


* 


* 




1.33 1.27 


Asn 


24 




B 






T 




1.38 


-0.23 


• 


* 




1.87 1.10 


Met 


25 








T 


T 




1.73 


-0.23 


* 


* 




2.61 1.49 


Asp 


26 








T 


T 




1.52 


-0.71 


* 




F 


3.40 3.44 


Thr 


27 








T 


T 




1.94 


-1.07 


* 


* 


F 


3.06 331 


Arg 


28 








T 






1.79 


-1.47 


• 


* 


F 


2.52 635 


Lys 


29 




B 










0.90 


-1.44 


* 




F 


1.78 2.91 


Pro 


30 




B 


B 








U9 


-0.76 


* 




F 


1.24 1.41 


Arg 


31 




B 


B 








0.94 


-0.81 






F 


0.90 1.12 


Val 


32 




B 


B 








0.96 


-0.39 




* 




0.30 0.55 


lie 


33 




B 






T 




0.96 


0.00 




* 


F 


0.22 0.48 


Pro 


34 




B 






T 




0,60 


-0.43 




* 


F 


0.79 0.48 


Gly 


35 








T 


T 




0.22 


0.06 


* 


* 


F 


0.56 0.93 


Ser 


36 




B 






T 




-0.59 


-0.09 


* 


* 


F 


0.88 1.34 
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442 






B 


B 




-0.42 


0.47 


* 


• 




-0.60 0.20 




Ala 


443 






B 


B 




0.03 


0.90 


* 


* 




-0.60 0.42 




Gly 


, 444 






B 


B 




-0.78 


0.26 


* 


* 




-030 0.52 


40 


Ala 


445 






B 






0.08 


0.29 




* 




-0.10 0.61 




Pro 


446 












C 0.29 


0.04 


m 


* 


F 


0.25 0.97 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



Arg 


447 






B 




Phe 


448 






B 




Asa 


449 










His 


450 






• 




Thr 


451 










Gly 


452 






■ 




Lys 


453 






B 




Val 


454 






B 


B 


lie 


455 






B 


B 


Leu 


456 






B 


B 


Phe 


457 






B 


B 


Thr 


458 






B 


B 


Met 


459 






B 


B 


His 


460 










Asn 


461 










Asn 


462 










Arg 


463 










Ser 


464 










Leu 


465 


A 


A 






Thr 


466 




A 


B 




Ile 


467 




A 


B 




His 


468 




A 


B 




Gin 


469 




A 


B 




Ala 


470 




A 


B 




Met 


471 






B 




Arg 


472 






B 




Gly 


473 






B 




Gin 


474 






B 




Gin 


475 






B 




lie 


476 






B 




Gly 


477 






B 




Ser 


478 






B 




Tyr 


479 






• 




Phe 


480 






B 




Gly 


481 








B 


Ser 


482 








B 


Glu 


483 






B 


B 


He 


484 






B 


B 


Thr 


485 






B 


B 


Ser 


486 






B 


B 


Val 


487 






B 


B 



T 



PCT/US99/2503I 

284 



• 




0.83 


0.04 


* 


* 


F 


020 133 


• 




1.08 


0.10 


* 


* 


. 


0.18 1.90 


* 




1.47 


0.03 


* 


• 


. 


0.71 122 


T 




1.20 


-0.40 


* 


* 


F 


1.79 124 


T 


C 


0.52 


0.24 


* 


* 


F 


1.12 1.06 


T 




-0.40 


0.14 


* 


* 


F 


130 0.46 


T 




-0.40 


0.43 


* 


* 


F 


0.47 028 






-0.71 


0.71 








-021 0.17 






-1.28 


0.71 








-0.34 025 






-1.00 


0.90 








-0.47 0.12 






-0.66 


1.40 








-0.60 022 






-0.70 


1.16 








-0.60 0.51 






027 


0.87 








-032 1.00 


T 


C 


0.86 


0.19 








1.01 226 


T 


c 


0.86 


-0.21 






F 


2.04 2.10 


T 


c 


124 


-0.01 






F 


232 1.75 


T 




0.67 


-0.14 






F 


2.80 1.85 




c 


123 


0.04 






F 


137 0.81 






127 


0.14 








0.54 0.68 






0.68 


0.14 








026 0.60 






0.08 


0.64 




* 




-032 0.46 






0.08 


0.87 




* 




-0.60 0.55 






0.03 


0.19 


* 


* 




-030 0.74 






0.84 


0.13 


* 


* 




0.02 1.05 


T 




1.16 


-0.16 


♦ 


t 




1.19 133 


T 




1.16 


-0.26 


* 


* 


F 


1.51 1.33 


T 




0.84 


0.03 


* 




F 


0.93 0.92 


T 




034 


-0.04 






F 


1.700.92 


- 




0.89 


-027 


* 




F 


133 0.63 


• 




0.79 


0.49 


* 




F 


0.41 1.00 


T 




033 


0.84 


* 




F 


029 0.50 


T 




0.38 


0.87 


♦ 




F 


0.12 029 


T 


c 


0.38 


0.86 


* 






0.00 035 


T 




-0.51 


0.17 






F 


025 0.96 




c 


0.07 


0.43 






F 


-025 0.50 




c 


0.11 


0.53 






F 


-025 0.46 






-0.44 


0.16 






F 


-0.15 0.71 






-020 


0.01 






F 


-0.15 0.54 






-039 


-0.41 




* 


F 


0.45 0.67 






-0.04 


-0.11 




* 


F 


0.45 0.27 






-0.09 


-o.n 




* 


F 


0.76 0.64 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCT/US99/25031 



285 





Asp 


488 




B 


B 


• 




-0.09 


-037 


• 




F 


1.07 0.44 




lie 


489 




B 




• 




0.46 


-0.86 


• 




F 


1.88 0.55 




' Asp 


490 




■ • 




T 


T 


. -0.09 


-0.81 


• 




F 


2.79 0.73 




Gly 


491 




• 




T 


T 


-0.10 


4>.81 


• 




F 


3.10 033 


5 


Asp 


492 




* 




T 


T 


. 0.76 


-033 


* 




F 


2.490.67 




Gly 


493 




B 






T 


-OAQ 


-1.01 


* 




F 


2.08 0.67 




Val 


494 




B 


B 






-0.02 


-037 






F 


1.07 0.50 




Hit 


495 




B 


B 






. -0.83 


-0.11 






F 


0.76 0.25 




Asp 


496 




B 


B 






-134 


0.57 


* 




F 


-0.45 0.21 


10 


Val 


497 




B 


B 






. -1.69 


0.79 


* 






-0.60 02\ 




Leu 


498 




B 


B 






-1.93 


0.57 








-0.60 0.14 




Leu 


499 




B 


B 






-1.29 


0.59 








-0.60 0.09 




Val 


500 




B 


B 






-1.58 


1.01 








-0.60 0.18 




Gly 


501 




B 


B 






-1.82 


0.99 








-0.60 02\ 


15 


Ala 


502 




B 








. -1.67 


1.06 




• 




-0.40 0.41 




Pro 


503 




B 








. -0.86 


1.16 




- 




-0.40 0.48 




Met 


504 




B 








-0.04 


0.91 








-0.40 0.77 




Tyr 


505 




B 








0.47 


0.49 








-0.25 133 




Phe 


506 




B 








0.92 


0.41 








-0.40 0.85 


20 


Asn 


507 


A 








T 


. 1.51 


-0.01 




* 




0.85 1.68 




Glu 


508 


A 








T 


. 1.83 


-0.63 




* 


F 


130 1.86 




Gly 


509 


A 








T 


. 2.09 


-1.39 




* 


F 


130420 




Arg 


510 


A 








T 


. 2.38 


-1.74 




* 


F 


130 238 




Glu 


511 


A 










. 222 


-2.14 




* 


F 


1.10 2.98 


25 


Arg 


512 






B 


T 




1.98 


-1.50 




* 


F 


130 224 




Gly 


513 






B 


T 




. 1.12 


-1.17 






F 


1.30 1.79 




Lys 


514 




B 


B 






. 1.22 


-0.53 




* 


F 


0.75 0.77 




Val 


515 




B 


B 






. 1.11 


0.23 




* 




-030 0.61 




Tyr 


516 




B 


B 






0.30 


023 




• 




-0.15 1.07 


30 


Val 


517 




B 


B 






030 


0.49 




* 




-0.60 0.44 




Tyr 


518 




B 








0.64 


0.49 


* 


* 




0.01 1.17 




Glu 


519 




B 








0.60 


0.24 


* 


* 




0.57 129 : 




Leu 


520 




B 








1.57 


-0.11 








1.43 2.80 




Arg 


521 


A 








T 


1.11 


-0.76 






F 


2.34 3.50 


35 


Gin 


522 




B 






T 


1.11 


-0.73 






F 


2.60 1.75 




Asn 


523 




B 






T 


1.11 


-0.09 






F 


2.04 1.57 




Arg 


524 




B 






T 


1.11 


-0.01 


* 


* 


F 


1.78 126 




Phe 


525 




B 








. 1.58 


039 


* 






0.57 1.17 




Val 


526 




B 






T 


1.16 


0.41 


* 


* 




0.06 0.72 


40 


Tyr 


527 




B 






T 


0.34 


030 








-0.20 0.53 




Asn 


528 




B 






T 


039 


1.19 




* 


F 


0.29 0.50 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCT/US99/25031 





Gly 


529 




• 




T 


T 


* 


0.28 


0.40 


* 


* 


F 


1.18 1.36 




Thr 


530 




• 








C 


0.68 


-0.24 




• 


F 


Z02 1.45 




Leu 


531 




• * 






T 


C 


1.50 


-0.61 




• 


F 


2.86 1.21 




Lys 


532 




• 






T 




1.44 


-OJt 




* 


F 


3.40 1.66 


5 


Asp 


533 




B 






T 




1.20 


-0.56 






F 


2.661.54 




Ser 


534 




B 






T 




1.54 


-0.29 






F 


2.02 2.93 




His 


535 




B 






T 




1.86 


-0.57 








1.83 2.54 




Ser 


536 




B 






T 




2.08 


-0.17 




* 




1.19 2.44 




Tyr 


537 




B 






T 




2.14 


0.33 








0.251.84 


10 


Gin 


538 




B 






T 




1.44 


-0.06 




* 




0.85 2.65 




Asn 


539 




B 










1.40 


0.23 




* 




0.05 1.71 




Ala 


540 




B 






• 




1.13 


0.27 




* 




0.05 1.08 




Arg 


541 




B 










1.13 


-0.10 


* 


* 




0.500.84 




Phe 


542 




B 










0.49 


-0.11 


* 


* 


F 


0.65 0.70 


15 


Gly 


543 




- 




T 


T 




-0.10 


0.17 


* 


* 


F 


0.65 0.48 




Ser 


544 




. 






T 




-0.40 


0.17 


* 


« 


F 


0.45 0.25 




Ser 


545 




B 






T 




-0.67 


0.56 




« 


F 


-0.05 0.39 




lie 


546 




B 






T 




-0.67 


0.41 


* 


* 




-0.20 029 




Ala 


547 




B 


B 








0.03 


-0.01 


* 




. 


030 0.42 


20 


Ser 


548 




B 


B 








-0.43 


-0.40 






. 


0.30 0.53 




Val 


549 




B 


B 








-0.13 


-0.10 


* 






0.64 0.62 




Arg 


550 




B 


B 








0.17 


-039 


* 




F 


1.13 0.99 




Asp 


551 




B 










1.06 


-0.49 


* 




F 


1.82 1.28 




Leu 


552 




B 










1.34 


-0.87 


* 




F 


2.46 2.88 


25 


Asn 


553 








T 


T 




1.40 


-1.13 


* 




F 


3.40 1.97 




Gin 


554 








T 


T 




2.26 


-037 


* 




F 


2.76 1.85 




Asp 


555 








T 


T 




Z14 


0.03 






F 


1.82 3.60 




Ser 


556 








T 


T 




1.29 


-0.66 


* 




F 


238 3.74 




Tyr 


557 






B 


T 






1.24 


-0.41 






F 


1341.60 


30 


Asn 


558 




B 


B 








039 


-0.17 






F 


0.45 0.71 




Asp 


559 




B 


B 








0.04 


0.47 








-0.60 039 




Val 


560 




B 


B 








-0.54 


Oil 








-0.60 0.25 




Val 


561 




B 


B 








-0.46 


0.26 








-0.30 0.16 




Val 


562 




B 


B 








-1.02 


029 








-030 0.14 


35 


Gly 


563 




B 


B 








-1.02 


0.97 








-0.60 0.16 




Ala 


564 


A 


B 










-1.02 


033 








-0.30 038 




Pro 


565 A A 












-0.17 


-0.31 








030 0.85 




Leu 


566 i 


\ A 












0.66 


-0.56 








0.75 137 




Glu 


567 i 


\ A 












0.92 


-0.49 






F 


0.60 1.85 


40 


Asp 


568 


\ A 












0.92 


-0.49 








0.45 1.21 




Asn 


569 


K A 












0.92 


-0.49 








0.45 1.45 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCT/US99/25031 



287 



His 


570 


A 


A 












0.24 


-0.67 








0.600.85 


Ala 


571 


A 






B 








0.81 


0.01 








-0.30 036 


Gly 


572 


A 






B 








-0.08 


0.77 








-0.60 035 


Ala 


573 


A 






B 








-0.78 


1.06 








-0.60 0.18 


He 


574 






B 


B 








-0.81 


1.34 








-0.60 0.15 


Tyr 


575 






B 


B 








-1.12 


134 








-0.60 0.21 


He 


576 






B 


B 








-1.23 


134 


« 






-0.60 0.21 


Phe 


577 






B 


B 








-0.78 


1.63 


* 


* 




-0.60 0.25 


His 


578 






B 


B 








-0.53 


0.94 


* 


* 




-0.60 032 


Gly 


579 








B 


T 






0.06 


0.61 


* 


* 




-010 0.45 


Phe 


580 










T 


T 




-0.59 


031 




* 




0.50 0.69 


Arg 


581 










T 


T 




-031 


0.21 


* 


* 


F 


0.65 036 


Gly 


582 










T 


T 




013 


0.40 






F 


035 030 


Ser 


583 










T 


T 




-0.04 


-0.03 






F 


115 0.69 


He 


.584 












. 


C 


0.09 


-033 


* 


* 


F 


1.15 031 


Leu 


585 














C 


0.83 


0.10 


* 


* 


F 


0.85 0.79 


Lys 


586 












. 


C 


0.72 


-033 






F 


1.90 1.18 


Thr 


587 












T 


c 


1.18 


-031 


* 


* 


F 


2.40 2.92 


Pro 


588 












T 


c 


0.59 


-1.00 


* 


* 


F 


3.00 6.94 


Lys 


589 






B 






T 




1.17 


-1.00 


* 


* 


F 


2.50 2.43 


Gin 


590 






B 






T 




139 


-031 


* 


* 


F 


Z20 2.43 


Arg 


591 






B 


B 








1.04 


-030 




* 


F 


1.20 139 


He 


592 






B 


B 








1.36 


-0.54 




* 


F 


110 1.06 


Thr 


593 




A 


B 


B 








0.76 


-0.54 




* 


F 


0.90 1.06 


Ala 


594 




A 


B 


B 








0.12 


-016 




* 


F 


0.45 0.45 


Ser 


595 




A 


B 


. 








-0.19 


0.24 






F 


-0.15 0.65 


Ghi 


596 




A 


B 










-0.64 


0.04 




* 


F 


-0.15 0.65 


Leu 


597 


A 


A 




B 








-037 


-0.01 






F 


0.45 0.63 


Ala 


598 


A 


A 


. 


B 








-016 


0.17 


* 






-030 039 


Thr 


599 


A 


A 




B 








0.09 


0.19 


* 






-030 039 


Gly 


600 


A 


A 




B 








-0.31 


0.94 


* 






-0.60 0.74 


Leu 


601 






B 


B 








-0.66 


1.04 


* 






-0.60 0.63 


Gin 


602 






B 


B 








-0.51 


0.97 


* 






-0.60 0.44 


Tyr 


603 










T 


T 




-0.22 


1.06 


* 






010 014 


Phe 


604 






B 






T 




-0.80 


1.01 


* 






.0.20 0.38 


Gly 


605 






B 






T 




-049 


1.01 


* 






-010 0.15 


Cys 


606 






B 






T 




-0.02 


1.11 




* 




-010 0.13 


Ser 


607 






B 


B 








-0.02 


0.79 




* 




-0.60 0.15 


He 


608 






B 


B 








-0.59 


0.40 








-0.60 017 


His 


609 






B 


B 








0.11 


0.66 




* 




-0.60 0.41 


Gly 


610 






B 


B 








-0.36 


0.09 




* 




-0.30 032 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCT/US99/25031 



288 



Gin 


611 


• 




B 








. 031 


0.39 






-0.10 0.61 


Leu 


612 


• • 




B 






• 


0.61 


0.10 


. * 




-0.10 0.72 


Asp 


613 


• 




B 








. 150 


-0.40 


. * 




0.65 1.26 


Leu 


614 






B 








1.19 


-0.83 


. * 


• 


0.95 1.21 


Asn 


615 ' 


A 




• 






T 


. 0.72 


-0.80 


. * 


F 


1.30 1.45 


Glu 


616 


A 




• 






T 


. -0.17 


-0.80 


• * 


F 


1.15 0.72 


Asp 


617 


A 




• 






T 


. 0.64 


-0.11 


. * 


F 


0.85 0.61 


Gly 


618 


A 




• 






T 


. -0.17 


-0.80 




F 


1.15 0.63 


Leu 


619 


A 




• 


B 






0.06 


-0.51 






0.60 030 


He 


620 


A 




* 


B 






-0.80 


-0.01 






030 0.18 


Asp 


621 


* 




B 


B 






. -1.14 


0.63 






-0.60 0.14 


Leu 


622 






B 


B 






. -1.73 


0.63 


* * 




-0.60 0,16 


Ala 


623 


• 




B 


B 






. -2.20 


0.44 






-0.60 0.24 


Val 


624 


A 






B 






. -1.73 


0.44 


* 




-0.60 0.12 


Gly 


625 


A 


A 




• 






-0.84 


0.87 


* * 




-0.60 0.14 


Ala 


626 


A 


A 




• 






. -1.43 


059 






-0.60 0.22 


Leu 


627 


A 


A 




- 






-1.48 


059 






-0.60 0.30 


Gly 


628 


A 


A 




. 






. -1.78 


059 






-0.60 0.23 


Asn 


629 




A 


B 


B 






. -1.73 


0.84 






-0.60 0.16 


Ala 


630 




A 


B 


B 






. -1.68 


1.03 






-0.60 0.16 


Val 


631 




A 


B 


B 






. -139 


1.26 


* 




-0.60 0.17 


lie 


632 




A 


B 


B 






. -0.47 


1.21 


* 




-0.60 0.14 


Leu 


633 




A 


B 


B 






. -033 


0.81 


* 




-0.60 0.27 


Trp 


634 




A 


B 


B 






. -1.19 


0.74 


* 




-0.60 057 


Ser 


635 




A 


B 


B 






. -1.46 


0.74 


• 


. 


-0.60 0.60 


Arg 


636 






B 


B 






. -0.60 


0.70 


* * 


F 


-0.45 054 


Pro 


637 






B 


B 






. -0.60 


0.41 


* * 
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He 
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0.90 0.86 
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0.63 


-0.56 
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1.95 0.86 
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272 
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F 
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F 
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-0.37 
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279 
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0.94 
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B 
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0.94 
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-0.60 0.28 
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Val 
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B 


B 
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B 
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0.44 
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0.80 
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1.09 
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1.21 


. 








1.21 


B 








0.40 


B 








0.10 


B 








-0.24 
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0.14 


B 








036 


B 






C 


-0.46 






T 


c 
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T 


c 
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T 


c 


-0.57 






T 




0.24 










-0.27 










0.08 










033 










0.02 








c 


0.83 




T 






1.53 




T 






Z08 




T 


T 
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T 


T 
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T 
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T 
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1.46 
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-0.03 






T 




-0.10 






T 




-0.02 










-0.06 










0.23 










0.53 










0.29 






T 




0.58 



-0.80 


* 


* 




0.75 1.19 


-1.10 


* 


* 


F 


0.90 1.59 


-131 


* 


* 


F 


1.30 1.26 


-0.91 


* 


* 


F 


0.90 2.93 


-1.06 


* 


* 


F 


0.90 239 


-0.56 






F 


0.90 1.45 


0.09 








-030 0.62 


0.77 








-0.60 0.52 


0.54 








-0.60 0.45 


0.54 








-0.60 0.26 


057 








-0.60 0.29 


0.47 


* 




F 


-0.25 0.46 


0.66 


« 




F 


0.15 0.51 


0.80 


* 




F 


0.15 0.54 


0.80 






F 


0.15 0.28 


0.81 








-0.20 0.46 


0.79 


♦ 






-0.40 0.48 


1.04 


* 






-0.40 0.24 


1.04 








-0.40 0.48 
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039 1.10 
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F 


1.08 221 


-0.63 






F 


2SI 236 


-0.63 






F 
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-0.60 






F 


3.40 2.83 


-0.80 




* 


F 


3.06 Z54 


-0.44 




* 


F 


2.42 1.99 


-0.44 


♦ 


* 


F 


2.08 2.70 


-036 




* 


F 


0.99 0.87 


-036 




* 




0.500.93 


-0.26 




* 




0.50 0.95 


-030 




* 




0.50 0.88 


-0.39 




* 




0.500.91 


-0.46 




* 




0.70 032 


0.19 




* 




0.10 0.45 


0.04 




* 




0.100.26 


0.04 


• 


* 




0.10 0.51 


0.04 


* 






0.10 038 


0.16 


* 


* 




030 0.32 


0.14 


• 


* 




0.50 0.62 


-0.54 


* 


* 


F 


1.90 1.40 


-0.43 




* 


F 


2.00 221 
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324 
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T 
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-0.24 




* 




1.65 1.09 


Asp 


325 


B 






T 




0.72 


0.44 
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0.400.89 


Tyr 


326 


B 


• 




T 




-0.18 


0.70 
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0.35 1.03 


Met 


327 


B 


B 








0.17 


0.84 








-0.40 051 


Leu 


328 


B 


B 








0.20 


121 








-0.60 0.53 


Tyr 


329 


B 


B 








0.29 


121 








-0.60 0.57 


Val 


330 


B 


B 








0.08 


0.89 








-0.60 0.89 


Tyr 


331 


B 










0.01 


0.70 








•025 1.67 


Asp 


332 


B 










0.30 


050 


* 




F 


-0.10 1.54 


Pro 


333 


B 






T 




0.22 


023 






F 


0.40 2.99 


Pro 


334 






T 


T 




0.26 


0.27 


* 




F 


0.80 1 .34 


Thr 


335 






T 


T 




0.90 


-0.06 


* 




F 


1.40 1.24 


Thr 


336 
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T 




0.93 


037 






F 


0.40 1.24 


He 


337 


B 










0.62 


0.37 






F 


0.32 1.24 


Pro 


338 


B 










0.52 


0.43 






F 


0.14 1.24 


Pro 


339 








T 


C 


0.42 


0.43 






F 


0.66 1.24 


Pro 


340 








T 


C 


0.42 


0.43 






F 


0.78 255 


Thr 


341 


• 






T 


C 


0.42 


0.23 






F 


1.202.38 


Thr 


342 


B 






T 




1.00 


0.29 






F 


0.88 2.22 


Thr 


343 


B 


B 








0.90 


034 






F 


0.36 2.07 


Thr 


344 


B 


B 








0.80 


0.40 






F 


-0.06 2.07 


Hit 


345 


B 


B 








0.70 


0.40 






F 


-0.18 2.07 


Thr 


346 


~B 


B 








0.70 


0.40 






F 


-0.30 2.07 


Thr 


347 


B 


B 








0.70 


0.40 






F 


-0.30 2.07 


Thr 


348 


B 


B 








0.70 


0.40 






F 


-0.30 2.07 


Thr 


349 


B 


B 








0.70 


0.40 






F 


-0.30 2.07 


Thr 


350 


B 


B 








0.12 


0.40 






F 


-0.30 2.07 


Thr 


351 


B 


B 








-0.38 


0.60 






F 


-0.30 1.01 


Thr 


352 


B 


B 








-0.38 


0.80 






F 


-0.45 057 


Thr 


353 


B 


B 








-0.96 


0.80 


* 




F 


-0.45 0.57 


lie 


354 


B 


B 








-1.53 


1.00 


* 






-0.60 0.28 


Leu 


355 


B 


B 








-1.53 


no 


* 






-0.60 0.14 


Thr 


356 


B 


B 








-1.22 


120 


* 






-0.60 0.14 


lie 


357 


B 


B 








-1.21 


0.71 


* 


* 




-0.60 0.32 


lie 


358 


B 


B 








-0.79 


0.41 




* 




-0.26 0.53 


Thr 


359 


B 


B 








-0.49 


-0.27 




* 


F 


1.13 0.71 


Asp 


360 


B 






T 




0.43 


-0.26 




* 


F 


2.02 1.03 


Ser 


361 








T 


C 


0.36 


-0.94 




* 


F 


2.86 2.87 


Arg 


362 






T 


T 




0.86 


-1.20 




* 


F 


3.40 2.54 


Ala 


363 






T 


T 




1.36 


-1.26 




* 




2.91 1.95 


Arg 


364 






T 






1.28 


-0.83 




* 




2.37 1.86 
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Table XI 
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15 



20 



25 
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40 
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1 


11 


m 


IV 


V 


VI 


VII 


VUI 


IX 


X 


XI 


XII 


XIII 


XIV 


Met 


1 


A 














0.14 


0.06 








-0.10 


0.89 


Ser 


2 


A 










T 




-0.28 


0.01 








0.10 0.94 


Ser 


3 


A 










T 




0.16 


0.27 








0.10 0.60 


Ser 


4 


A 










T 




0.51 


-0.16 


* 






0.85 1.22 


Ser 


5 


A 










T 




0.09 


-0.27 


* 




F 


1.00124 


Leu 


6 


. A 


A 












-0.12 


0.03 


* 




F 


-0.15 


0.76 


Lys 


7 


A 


A 












-0.49 


033 


* 




F 


-0.15 


0.47 


His 


8 


A 


A 












-0.79 


0.51 








-0.60 


0.19 


Leu 


9 


A 


A 












-1.08 


0.74 


* 






-0.60 


0.23 


Leu 


10 


A 


A 












-1.59 


0.56 


* 


* 




-0.60 


0.11 


Cys 


11 


A 


A 












-1.08 


1.24 


* 


* 




-0.60 


0.07 


Met 


12 


A 


A 












-1.41 


1.13 


* 






-0.60 


0.11 


Ala 


13 


A 


A 












-Z08 


136 


* 






-0.60 


0.14 


Leu 


14 


A 


A 












-1.57 


1.46 


* 






-0.60 


023 


Ser 


15 


A 






B 








-1.06 


127 


* 






-0.60 


0.31 


Trp 


16 


A 






B 








-1.09 


1.04 


* 






-0.60 


0.41 


Phe 


17 


A 






B 








-1.38 


133 








-0.60 


0.43 


Ser 


18 








B 






C 


-1.09 


133 


* 






-0.40 


0.23 


Ser 


19 








B 






C 


-0.62 


133 


* 






-0.40 


029 


Phe 


20 








B 






c 


-032 


0.84 








-0.40 


033 


He 


21 








B 






c 


-034 


0.06 








-0.10 


0.43 


Ser 


22 








B 






c 


0.06 


0.16 


* 


* 


F 


0.05 0.46 


Gly 


23 














c 


-0.34 


0.16 




* 


F 


0.25 0.71 


Glu 


24 














c 


-034 


0.16 




* 


F 


0.25 0.88 


Thr 


25 














c 


-0.46 


-0.14 




* 


F 


0.85 0.88 


Ser 


26 














c 


-038 


0.16 




* 


F 


0.25 0.73 


Phe 


27 


A 














-0.08 


0.41 


* 


• 




-0.40 


035 


Ser 


28 


A 














-0.03 


0.81 




* 




-0.40 


039 


Leu 


29 














c 


-0.73 


0.71 


* 






-0.20 


039 


Leu 


30 














c 


-1.12 


1.11 








-0.20 


039 


Asn 


31 










T 


T 




-1.63 


1.11 








0.20 025 


Ser 


32 










T 


T 




-1.14 


1.41 


* 






0.20 0.25 


Phe 


33 










T 


T 




-1.09 


i.16 








0.20 0.47 


Phe 


34 






B 






T 




-0.49 


123 








-0.20 


0.46 


Leu 


35 














c 


0.02 


1.26 








-0.20 


0.53 


Pro 


36 










T 






-0.28 


1.26 








0.00 0.82 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



PCTAJS99/25031 



332 



Tvr 
iyr 


17 






T 
I 


T 




0 11 
v. 1 J 


V.OO 




* 


r 


u.o/ 


riu 


1R 






T 
1 


T 




ft n 
U.I / 


ft ftTT 






T? 

r 


1.14 J.Vl 




Jy 






T 
i 


T 
1 




U.ZU 


A (\A 

-U.U4 






r 


1.91 1.05 


ocr 








T 


*r 
i 




U.j4 


ft 1ft 
U.1U 






F 
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F 
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Cys 
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B 
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-0.20 


0.27 








0.78 0.08 


Cys 


43 




B 


T 






-0.84 


0.27 








0.61 0.08 


Cys 


44 




B 


T 






-0.54 


0.53 








0.14 0.03 
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45 




B 


T 






-0.91 


0.93 








-0.03 0.10 
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46 




B 
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-1.32 


0.93 








-0.20 0.10 
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47 




B 


T 






-1.54 


0.74 








-0.20 0O5 


Gin 


48 




B 


T 






-1.69 


0.86 








-0.20 0.20 


Cys 


49 




B 


T 






-1.02 


0.76 








-0.20 0.12 


2»er 


50 




B 


T 






-0.53 


037 








0.10 0.28 


He 


51 




B 


T 






-0.93 


0.16 








0.10 0.25 


Leu 


52 




B 


T 






-038 


0.54 








•0.20 0.40 


Asp 


53 








T 


C 


-1.04 


036 


• 


• 




030 0.40 


Pro 


54 






T 


T 




-038 


034 


# 
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035 030 


Phe 


55 






T 


T 




-0.38 


0.26 








030 039 


Ser 


56 






T 


T 
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-0.04 
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1.10 0.48 


Cys 


57 






T 


T 




0.83 


0.57 


* 






0.20 030 


Asn 


58 






T 


T 




0.13 


0.14 


* 
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0.50 0.69 


Ser 


59 
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T 
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0.14 
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030 0.44 


Met 


60 
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T 
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0.86 1.28 


Arg 


61 
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0.84 
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0.22 0.84 


Phe 
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0.88 1.08 
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63 
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* 
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Trp 


64 
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Glu 
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T 


T 




034 
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Cys 


29 










T 


T 




-0.03 
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1.72 0.21 


Cys 
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B 








-0.89 


037 








-0.29 0.10 


Ser 
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B 








-1.10 


0.79 
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Pro 
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He 
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Table XIII summarizes the information corresponding to each "Gene No." described 
above. The nucleotide sequence identified as "NT SEQ ID NO:X" was assembled from 
partially homologous ("overlapping") sequences obtained from the "cDNA clone ID" 
identified in Table XIII and, in some cases, from additional related DNA clones. The 
overlapping sequences were assembled into a single contiguous sequence of high 
redundancy (usually three to five overlapping sequences at each nucleotide position), 
resulting in a final sequence identified as SEQ ID NO:X. 

The cDNA Clone ID was deposited on the date and given the corresponding 
deposit number listed in "ATCC Deposit No:Z and Date." Some of the deposits contain 
multiple different clones corresponding to the same gene. "Vector" refers to the type of 
vector contained in the cDNA Clone ID. 

'Total NT Seq." refers to the total number of nucleotides in the contig identified by 
"Gene No." The deposited clone may contain all or most of these sequences, reflected by 
the nucleotide position indicated as "5' NT of Clone Seq." and the "3' NT of Clone Seq." 
of SEQ ID NO:X. The nucleotide position of SEQ ID NO:X of the putative start codon 
(methionine) is identified as "5' NT of Start Codon." Similarly , the nucleotide position of 
SEQ ID NO:X of the predicted signal sequence is identified as "5' NT of First AA of 
Signal Pep." 

The translated amino acid sequence, beginning with the methionine, is identified as 
"AA SEQ ID NO: Y," although other reading frames can also be easily translated using 
known molecular biology techniques. The polypeptides produced by these alternative 
open reading frames are specifically contemplated by the present invention. 

The first and last amino acid position of SEQ ID NO: Y of the predicted signal 
peptide is identified as "First A A of Sig Pep" and "Last AA of Sig Pep." The predicted 
first amino acid position of SEQ ID NO:Y of the secreted portion is identified as 
"Predicted First AA of Secreted Portion." Finally, the amino acid position of SEQ ID 
NO: Y of the last amino acid in the open reading frame is identified as "Last AA of ORF." 
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SEQ ID NO:X (where X may be any of the polynucleotide sequences disclosed in 
the sequence listing) and the translated SEQ ID NO:Y (where Y may be any of the 
polypeptide sequences disclosed in the sequence listing) are sufficiently accurate and 
otherwise suitable for a variety of uses well known in the art and described further below. 
For instance, SEQ ID NO:X is useful for designing nucleic acid hybridization probes that 
will detect nucleic acid sequences contained in SEQ ID NO.X or the cDNA contained in 
the deposited clone. These probes will also hybridize to nucleic acid molecules in 
biological samples, thereby enabling a variety of forensic and diagnostic methods of the 
invention. Similarly, polypeptides identified from SEQ ID NO:Y may be used, for 
example, to generate antibodies which bind specifically to proteins containing the 
polypeptides and the secreted proteins encoded by the cDNA clones identified in Table 
XIII. 

Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or 
deletions of nucleotides in the generated DNA sequence. The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino acid 
sequence. In these cases, the predicted amino acid sequence diverges from the actual 
amino acid sequence, even though the generated DNA sequence may be greater than 
99.9% identical to the actual DNA sequence (for example, one base insertion or deletion in 
an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide sequence 
or the amino acid sequence, the present invention provides not only the generated 
nucleotide sequence identified as SEQ ID NO:X and the predicted translated amino acid 
sequence identified as SEQ ID NO: Y, but also a sample of plasmid DNA containing a 
human cDNA of the invention deposited with the ATCC, as set forth in Table XIII. The 
nucleotide sequence of each deposited clone can readily be determined by sequencing the 
deposited clone in accordance with known methods. The predicted amino acid sequence 
can then be verified from such deposits. Moreover, the amino acid sequence of the protein 
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encoded by a particular clone can also be directly determined by peptide sequencing or by 
expressing the protein in a suitable host cell containing the deposited human cDNA, 
collecting the protein, and determining its sequence. 

The present invention also relates to the genes corresponding to SEQ ID NO:X, 
SEQ ID NO:Y, or the deposited clone. The corresponding gene can be isolated in 
accordance with known methods using the sequence information disclosed herein. Such 
methods include preparing probes or primers from the disclosed sequence and identifying 
or amplifying the corresponding gene from appropriate sources of genomic material. 

Also provided in the present invention are allelic variants, orthologs, and/or species 
homologs. Procedures known in the art can be used to obtain full-length genes, allelic 
variants, splice variants, full-length coding portions, orthologs, and/or species homologs of 
genes corresponding to SEQ ID NO:X, SEQ ID NO:Y, or a deposited clone, using 
information from the sequences disclosed herein or the clones deposited with the ATCC. 
For example, allelic variants and/or species homologs may be isolated and identified by 
making suitable probes or primers from the sequences provided herein and screening a 
suitable nucleic acid source for allelic variants and/or the desired homologue. 

The polypeptides of the invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
combination of these methods. Means for preparing such polypeptides are well understood 
in the art. 

The polypeptides may be in the form of the secreted protein, including the mature 
form, or may be a part of a larger protein, such as a fusion protein (see below). It is often 
advantageous to include an additional amino acid sequence which contains secretory or 
leader sequences, pro-sequences, sequences which aid in purification , such as multiple 
histidine residues, or an additional sequence for stability during recombinant production. 

The polypeptides of the present invention are preferably provided in an isolated 
form, and preferably are substantially purified. A recombinantly produced version of a 
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polypeptide, including the secreted polypeptide, can be substantially purified using 
techniques described herein or otherwise known in the art, such as, for example, by the 
one-step method described in Smith and Johnson, Gene 67:31-40 (1988). Polypeptides of 
the invention also can be purified from natural, synthetic or recombinant sources using 
techniques described herein or otherwise known in the art, such as, for example, antibodies 
of the invention raised against the secreted protein. 

The present invention provides a polynucleotide comprising, or alternatively 
consisting of, the nucleic acid sequence of SEQ ID NO:X, and/or a cDNA contained in 
ATCC deposit Z. The present invention also provides a polypeptide comprising, or 
alternatively, consisting of, the polypeptide sequence of SEQ ID NO: Y and/or a 
polypeptide encoded by the cDNA contained in ATCC deposit Z. Polynucleotides 
encoding a polypeptide comprising, or alternatively consisting of the polypeptide sequence 
of SEQ ID NO: Y and/or a polypeptide sequence encoded by the cDNA contained in 
ATCC deposit Z are also encompassed by the invention. 
Signal Sequences 

The present invention also encompasses mature forms of the polypeptide having 
the polypeptide sequence of SEQ ID NO:Y and/or the polypeptide sequence encoded by 
the cDNA in a deposited clone. Polynucleotides encoding the mature forms (such as, for 
example, the polynucleotide sequence in SEQ ID NO:X and/or the polynucleotide 
sequence contained in the cDNA of a deposited clone) are also encompassed by the 
invention. According to the signal hypothesis, proteins secreted by mammalian cells have 
a signal or secretary leader sequence which is cleaved from the mature protein once export 
of the growing protein chain across the rough endoplasmic reticulum has been initiated. 
Most mammalian cells and even insect cells cleave secreted proteins with the same 
specificity. However, in some cases, cleavage of a secreted protein is not entirely uniform, 
which results in two or more mature species of the protein. Further, it has long been 
known that cleavage specificity of a secreted protein is ultimately determined by the 
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primary structure of the complete protein, that is, it is inherent in the amino acid sequence 
of the polypeptide. 

Methods for predicting whether a protein has a signal sequence, as well as the 
cleavage point for that sequence, are available. For instance, the method of McGeoch, 
Virus Res. 3:271-286 (1985), uses the information from a short N-terminal charged region 
and a subsequent uncharged region of the complete (uncleaved) protein. The method of 
von Heinje, Nucleic Acids Res. 14:4683-4690 (1986) uses the information from the 
residues surrounding the cleavage site, typically residues -13 to +2, where +1 indicates the 
amino terminus of the secreted protein. The accuracy of predicting the cleavage points of 
known mammalian secretory proteins for each of these methods is in the range of 75-80%. 
(von Heinje, supra.) However, the two methods do not always produce the same predicted 
cleavage point(s) for a given protein. 

In the present case, the deduced amino acid sequence of the secreted polypeptide 
was analyzed by a computer program called SignalP (Henrik Nielsen et al., Protein 
Engineering 10:1-6 (1997)), which predicts the cellular location of a protein based on the 
amino acid sequence. As part of this computational prediction of localization, the methods 
of McGeoch and von Heinje are incorporated. The analysis of the amino acid sequences of 
the secreted proteins described herein by this program provided the results shown in Table 

xin. 

As one of ordinary skill would appreciate, however, cleavage sites sometimes vary 
from organism to organism and cannot be predicted with absolute certainty. Accordingly, 
the present invention provides secreted polypeptides having a sequence shown in SEQ ID 
NO:Y which have an N-terminus beginning within 5 residues (i.e., + or - 5 residues) of the 
predicted cleavage point. Similarly, it is also recognized that in some cases, cleavage of 
the signal sequence from a secreted protein is not entirely uniform, resulting in more than 
one secreted species. These polypeptides, and the polynucleotides encoding such 
polypeptides, are contemplated by the present invention. 
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Moreover, the signal sequence identified by the above analysis may not necessarily 
predict the naturally occurring signal sequence. For example, the naturally occurring 
signal sequence may be further upstream from the predicted signal sequence. However, it 
is likely that the predicted signal sequence will be capable of directing the secreted protein 
to the ER. Nonetheless, the present invention provides the mature protein produced by 
expression of the polynucleotide sequence of SEQ ID NO:X and/or the polynucleotide 
sequence contained in the cDNA of a deposited clone, in a mammalian cell (e.g., COS 
cells, as desribed below). These polypeptides, and the polynucleotides encoding such 
polypeptides, are contemplated by the present invention. 

Polynucleotide and Poly peptide Vflrfo^ 

The present invention is directed to variants of the polynucleotide sequence 
disclosed in SEQ ID NO:X, the complementary strand thereto, and/or the cDNA sequence 
contained in a deposited clone. 

The present invention also encompasses variants of the polypeptide sequence 
disclosed in SEQ ID NO:Y and/or encoded by a deposited clone. 

"Variant" refers to a polynucleotide or polypeptide differing from the 
polynucleotide or polypeptide of the present invention, but retaining essential properties 
thereof. Generally, variants are overall closely similar, and, in many regions, identical to 
the polynucleotide or polypeptide of the present invention. 

The present invention is also directed to nucleic acid molecules which comprise, or 
alternatively consist of, a nucleotide sequence which is at least 80%, 85%, 90%, 95%, 
96%, 97%, 98% or 99% identical to, for example, the nucleotide coding sequence in SEQ 
ID NO:X or the complementary strand thereto, the nucleotide coding sequence contained 
in a deposited cDNA clone or the complementary strand thereto, a nucleotide sequence 
encoding the polypeptide of SEQ ID NO: Y, a nucleotide sequence encoding the 
polypeptide encoded by the cDNA contained in a deposited clone, and/or polynucleotide 
fragments of any of these nucleic acid molecules (e.g., those fragments described herein). 
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Polynucleotides which hybridize to these nucleic acid molecules under stringent 
hybridization conditions or lower stringency conditions are also encompassed by the 
invention, as are polypeptides encoded by these polynucleotides. 

The present invention is also directed to polypeptides which comprise, or 
alternatively consist of, an amino acid sequence which is at least 80%, 85%, 90%, 95%, 
96%, 97%, 98%, 99% identical to, for example, the polypeptide sequence shown in SEQ 
ID NO:Y, the polypeptide sequence encoded by the cDNA contained in a deposited clone, 
and/or polypeptide fragments of any of these polypeptides (e.g., those fragments described 
herein). 

By a nucleic acid having a nucleotide sequence at least, for example, 95% 
"identical" to a reference nucleotide sequence of the present invention, it is intended that 
the nucleotide sequence of the nucleic acid is identical to the reference sequence except 
that the nucleotide sequence may include up to five point mutations per each 100 
nucleotides of the reference nucleotide sequence encoding the polypeptide. In other 
words, to obtain a nucleic acid having a nucleotide sequence at least 95% identical to a 
reference nucleotide sequence, up to 5% of the nucleotides in the reference sequence may 
be deleted or substituted with another nucleotide, or a number of nucleotides up to 5% of 
the total nucleotides in the reference sequence may be inserted into the reference sequence. 
The query sequence may be an entire sequence shown in Table XIII, the ORF (open 
reading frame), or any fragment specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or polypeptide 
is at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 
computer programs. A preferred method for determining the best overall match between a 
query sequence (a sequence of the present invention) and a subject sequence, also referred 
to as a global sequence alignment, can be determined using the FASTDB computer 
program based on the algorithm of Brutlag et al. (Comp. App. Biosci. 6:237-245(1990)). 
In a sequence alignment the query and subject sequences are both DNA sequences. An 
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RNA sequence can be compared by converting U's to T's. The result of said global 
sequence alignment is in percent identity. Preferred parameters used in a FASTDB 
alignment of DNA sequences to calculate percent identiy are: Matrix=Unitary, k-tuple=4, 
Mismatch Penalty=l, Joining Penalty=30, Randomization Group Length=0, Cutoff 
Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght of the 
subject nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3" 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for 5' and 3' truncations 
of the subject sequence when calculating percent identity. For subject sequences truncated 
at the 5' or 3' ends, relative to the query sequence, the percent identity is corrected by 
calculating the number of bases of the query sequence that are 5' and 3' of the subject 
sequence, which are not matched/aligned, as a percent of the total bases of the query 
sequence. Whether a nucleotide is matched/aligned is determined by results of the 
FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using the specified parameters, to 
arrive at a final percent identity score. This corrected score is what is used for the 
purposes of the present invention. Only bases outside the 5' and 3' bases of the subject 
sequence, as displayed by the FASTDB alignment, which are not matched/aligned with the 
query sequence, are calculated for the purposes of manually adjusting the percent identity 
score. 

For example, a 90 base subject sequence is aligned to a 100 base query sequence to 
determine percent identity. The deletions occur at the 5' end of the subject sequence and 
therefore, the FASTDB alignment does not show a matched/alignment of the first 10 bases 
at 5' end. The 10 unpaired bases represent 10% of the sequence (number of bases at the 5' 
and 3' ends not matched/total number of bases in the query sequence) so 10% is subtracted 
from the percent identity score calculated by the FASTDB program. If the remaining 90 
bases were perfectly matched the final percent identity would be 90%. In another 
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example, a 90 base subject sequence is compared with a 100 base query sequence. This 
time the deletions are internal deletions so that there are no bases on the 5' or 3' of the 
subject sequence which are not matched/aligned with the query. In this case the percent 
identity calculated by FASTDB is not manually corrected. Once again, only bases 5' and 
3' of the subject sequence which are not matched/aligned with the query sequence are 
manually corrected for. No other manual corrections are to made for the purposes of the 
present invention. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that the 
amino acid sequence of the subject polypeptide is identical to the query sequence except 
that the subject polypeptide sequence may include up to five amino acid alterations per 
each 100 amino acids of the query amino acid sequence. In other words, to obtain a 
polypeptide having an amino acid sequence at least 95% identical to a query amino acid 
sequence, up to 5% of the amino acid residues in the subject sequence may be inserted, 
deleted, (indels) or substituted with another amino acid. These alterations of the reference 
sequence may occur at the amino or carboxy terminal positions of the reference amino acid 
sequence or anywhere between those terminal positions, interspersed either individually 
among residues in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

As a practical matter, whether any particular polypeptide is at least 80%, 85%, 
90%, 95%, 96%, 97%, 98% or 99% identical to, for instance, an amino acid sequences 
shown in Table XIII (SEQ ID NO: Y) or to the amino acid sequence encoded by cDNA 
contained in a deposited clone can be determined conventionally using known computer 
programs. A preferred method for determing the best overall match between a query 
sequence (a sequence of the present invention) and a subject sequence, also referred to as a 
global sequence alignment, can be determined using the FASTDB computer program 
based on the algorithm of Brutlag et a!. (Comp. App. Biosci. 6:237-245(1990)). In a 
sequence alignment the query and subject sequences are either both nucleotide sequences 
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or both amino acid sequences. The result of said global sequence alignment is in percent 
identity. Preferred parameters used in a FASTDB amino acid alignment are: 
Matrix=PAM 0, k-tuple=2, Mismatch Penalty=l, Joining Penalty=20, Randomization 
Group Length=0, Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, Gap 
5 Size Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, 
whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C-terminal 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for N- and C-terminal 

10 truncations of the subject sequence when calculating global percent identity. For subject 
sequences truncated at the N- and C-termini, relative to the query sequence, the percent 
identity is corrected by calculating the number of residues of the query sequence that are 
N- and C-terminal of the subject sequence, which are not matched/aligned with a 
corresponding subject residue, as a percent of the total bases of the query sequence. 

1 5 Whether a residue is matched/aligned is determined by results of the FASTDB sequence 
alignment. This percentage is then subtracted from the percent identity, calculated by the 
above FASTDB program using the specified parameters, to arrive at a final percent 
identity score. This final percent identity score is what is used for the purposes of the 
present invention. Only residues to the N- and C-termini of the subject sequence, which 

20 are not matched/aligned with the query sequence, are considered for the purposes of 

manually adjusting the percent identity score. That is, only query residue positions outside 
the farthest N- and C-terminal residues of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 

25 terminus of the subject sequence and therefore, the FASTDB alignment does not show a 
matching/alignment of the first 10 residues at the N-terminus. The 10 unpaired residues 
represent 10% of the sequence (number of residues at the N- and C- termini not 
matched/total number of residues in the query sequence) so 10% is subtracted from the 
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percent identity score calculated by the FASTDB program. If the remaining 90 residues 
were perfectly matched the final percent identity would be 90%. In another example, a 90 
residue subject sequence is compared with a 100 residue query sequence. This time the 
deletions are internal deletions so there are no residues at the N- or C-termini of the subject 
sequence which are not matched/aligned with the query. In this case the percent identity 
calculated by FASTDB is not manually corrected. Once again, only residue positions 
outside the N- and C-terminal ends of the subject sequence, as displayed in the FASTDB 
alignment, which are not matched/aligned with the query sequnce are manually corrected 
for. No other manual corrections are to made for the purposes of the present invention. 

The variants may contain alterations in the coding regions, non-coding regions, or 
both. Especially preferred are polynucleotide variants containing alterations which 
produce silent substitutions, additions, or deletions, but do not alter the properties or 
activities of the encoded polypeptide. Nucleotide variants produced by silent substitutions 
due to the degeneracy of the genetic code are preferred. Moreover, variants in which 5-10, 
1-5, or 1-2 amino acids are substituted, deleted, or added in any combination are also 
preferred. Polynucleotide variants can be produced for a variety of reasons, e.g., to 
optimize codon expression for a particular host (change codons in the human mRNA to 
those preferred by a bacterial host such as E. coli). 

Naturally occurring variants are called "allelic variants," and refer to one of several 
alternate forms of a gene occupying a given locus on a chromosome of an organism. 
(Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985).) These allelic variants 
can vary at either the polynucleotide and/or polypeptide level and are included in the 
present invention. Alternatively, non-naturally occurring variants may be produced by 
mutagenesis techniques or by direct synthesis. 

Using known methods of protein engineering and recombinant DNA technology, 
variants may be generated to improve or alter the characteristics of the polypeptides of the 
present invention. For instance, one or more amino acids can be deleted from the N- 
terminus or C-terminus of the secreted protein without substantial loss of biological 
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function. The authors of Ron et a]., J. Biol. Chem. 268: 2984-2988 (1993), reported 
variant KGF proteins having heparin binding activity even after deleting 3, 8, or 27 amino- 
terminal amino acid residues. Similarly, Interferon gamma exhibited up to ten times 
higher activity after deleting 8-10 amino acid residues from the carboxy terminus of this 
5 protein. (Dobeli et al., J. Biotechnology 7:199-216 (1988).) 

Moreover, ample evidence demonstrates that variants often retain a biological 
activity similar to that of the naturally occurring protein. For example, Gayle and 
coworkers (J. Biol. Chem 268:22105-22111 (1993)) conducted extensive mutational 
analysis of human cytokine IL-la. They used random mutagenesis to generate over 3,500 
10 individual IL-la mutants that averaged 2.5 amino acid changes per variant over the entire 
length of the molecule. Multiple mutations were examined at every possible amino acid 
position. The investigators found that "[m]ost of the molecule could be altered with little 
effect on either [binding or biological activity]." (See, Abstract.) In fact, only 23 unique 
amino acid sequences, out of more than 3.500 nucleotide sequences examined, produced a 
15 protein that significantly differed in activity from wild-type. 

Furthermore, even if deleting one or more amino acids from the N-terminus or C- 
terminus of a polypeptide results in modification or loss of one or more biological 
functions, other biological activities may still be retained. For example, the ability of a 
deletion variant to induce and/or to bind antibodies which recognize the secreted form will 
20 likely be retained when less than the majority of the residues of the secreted form are 

removed from the N-terminus or C-terminus. Whether a particular polypeptide lacking N- 
or C-terminal residues of a protein retains such immunogenic activities can readily be 
determined by routine methods described herein and otherwise known in the art. 

Thus, the invention further includes polypeptide variants which show substantial 
25 biological activity. Such variants include deletions, insertions, inversions, repeats, and 
substitutions selected according to general rules known in the art so as have little effect on 
activity. For example, guidance concerning how to make phenotypically silent amino acid 
substitutions is provided in Bowie et al., Science 247:1306-1310 (1990), wherein the 
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authors indicate that there are two main strategies for studying the tolerance of an amino 
acid sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during the process of evolution. By comparing amino acid sequences in different 
species, conserved amino acids can be identified. These conserved amino acids are likely 
important for protein function. In contrast, the amino acid positions where substitutions 
have been tolerated by natural selection indicates that these positions are not critical for 
protein function. Thus, positions tolerating amino acid substitution could be modified 
while still maintaining biological activity of the protein. 

The second strategy uses genetic engineering to introduce amino acid changes at 
specific positions of a cloned gene to identify regions critical for protein function. For 
example, site directed mutagenesis or alanine-scanning mutagenesis (introduction of single 
alanine mutations at every residue in the molecule) can be used. (Cunningham and Wells, 
Science 244:1081-1085 (1989).) The resulting mutant molecules can then be tested for 
biological activity. 

As the authors state, these two strategies have revealed that proteins are 
surprisingly tolerant of amino acid substitutions. The authors further indicate which amino 
acid changes are likely to be permissive at certain amino acid positions in the protein. For 
example, most buried (within the tertiary structure of the protein) amino acid residues 
require nonpolar side chains, whereas few features of surface side chains are generally 
conserved. Moreover, tolerated conservative amino acid substitutions involve replacement 
of the aliphatic or hydrophobic amino acids Ala, Val, Leu and He; replacement of the 
hydroxyl residues Ser and Thr; replacement of the acidic residues Asp and Glu; 
replacement of the amide residues Asn and Gin, replacement of the basic residues Lys, 
Arg, and His; replacement of the aromatic residues Phe, Tyr, and Tip, and replacement of 
the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 

Besides conservative amino acid substitution, variants of the present invention 
include (i) substitutions with one or more of the non-conserved amino acid residues, where 
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the substituted amino acid residues may or may not be one encoded by the genetic code, or 
(ii) substitution with one or more of amino acid residues having a substituent group, or (iii) 
fusion of the mature polypeptide with another compound, such as a compound to increase 
the stability and/or solubility of the polypeptide (for example, polyethylene glycol), or (iv) 
5 fusion of the polypeptide with additional amino acids, such as, for example, an IgG Fc 
fusion region peptide, or leader or secretory sequence, or a sequence facilitating 
purification. Such variant polypeptides are deemed to be within the scope of those skilled 
in the art from the teachings herein. 

For example, polypeptide variants containing amino acid substitutions of charged 
10 amino acids with other charged or neutral amino acids may produce proteins with 
improved characteristics, such as less aggregation. Aggregation of pharmaceutical 
formulations both reduces activity and increases clearance due to the aggregate's 
immunogenic activity. (Pinckard et al., Clin. Exp. Immunol. 2:33 1-340 (1967); Robbins et 
al., Diabetes 36: 838-845 (1987); Cleland et al.. Crit. Rev. Therapeutic Drug Carrier 
15 Systems 10:307-377(1993).) 

A further embodiment of the invention relates to a polypeptide which comprises the 
amino acid sequence of the present invention having an amino acid sequence which 
contains at least one amino acid substitution, but not more than 50 amino acid 
substitutions, even more preferably, not more than 40 amino acid substitutions, still more 
20 preferably, not more than 30 amino acid substitutions, and still even more preferably, not 
more than 20 amino acid substitutions. Of course, in order of ever-increasing preference, it 
is highly preferable for a peptide or polypeptide to have an amino acid sequence which 
comprises the amino acid sequence of the present invention, which contains at least one, 
but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 amino acid substitutions. In specific 
25 embodiments, the number of additions, substitutions, and/or deletions in the amino acid 
sequence of the present invention or fragments thereof (e.g., the mature form and/or other 
fragments described herein), is 1-5, 5-10, 5-25, 5-50, 10-50 or 50-150, conservative amino 
acid substitutions are preferable. 
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Polynucleotide and Polypeptide Fra^^tc 

The present invention is also directed to polynucleotide fragments of the 
polynucleotides of the invention. 

In the present invention, a "polynucleotide fragment" refers to a short 
polynucleotide having a nucleic acid sequence which: is a portion of that contained in a 
deposited clone, or encoding the polypeptide encoded by the cDNA in a deposited clone; is 
a portion of that shown in SEQ ID NO:X or the complementary strand thereto, or is a 
portion of a polynucleotide sequence encoding the polypeptide of SEQ ID NO:Y. The 
nucleotide fragments of the invention are preferably at least about 15 nt, and more 
preferably at least about 20 nt, still more preferably at least about 30 nt, and even more 
preferably, at least about 40 nt, at least about 50 nt, at least about 75 nt, or at least about 
150 nt in length. A fragment "at least 20 nt in length," for example, is intended to include 
20 or more contiguous bases from the cDNA sequence contained in a deposited clone or 
the nucleotide sequence shown in SEQ ID NO:X. In this context "about" includes the 
particularly recited value, a value larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, 
at either terminus or at both termini. These nucleotide fragments have uses that include, 
but are not limited to, as diagnostic probes and primers as discussed herein. Of course, 
larger fragments (e.g., 50, 150, 500, 600, 2000 nucleotides) are preferred. 

Moreover, representative examples of polynucleotide fragments of the invention, 
include, for example, fragments comprising, or alternatively consisting of, a sequence from 
about nucleotide number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 
351-400, 401-450, 451-500, 501-550, 551-600, 651-700, 701-750, 751-800, 800-850, 851- 
900,901-950,951-1000, 1001-1050, 1051-1100, 1101-1150, 1151-1200, 1201-1250, 
1251-1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500, 1501-1550, 1551-1600, 1601- 
1650, 1651-1700, 1701-1750, 1751-1800, 1801-1850, 1851-1900, 1901-1950, 1951-2000, 
or 2001 to the end of SEQ ID NO:X, or the complementary strand thereto, or the cDNA 
contained in a deposited clone. In this context "about" includes the particularly recited 
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ranges, and ranges larger or smaller by several (5, 4, 3, 2, or ]) nucleotides, at either 
terminus or at both termini. Preferably, these fragments encode a polypeptide which has 
biological activity. More preferably, these polynucleotides can be used as probes or 
primers as discussed herein. Polynucleotides which hybridize to these nucleic acid 
5 molecules under stringent hybridization conditions or lower stringency conditions are also 
encompassed by the invention, as are polypeptides encoded by these polynucleotides. 

In the present invention, a "polypeptide fragment" refers to an amino acid sequence 
which is a portion of that contained in SEQ ID NO: Y or encoded by the cDNA contained 
in a deposited clone. Protein (polypeptide) fragments may be "free-standing," or 

10 comprised within a larger polypeptide of which the fragment forms a part or region, most 
preferably as a single continuous region. Representative examples of polypeptide 
fragments of the invention, include, for example, fragments comprising, or alternatively 
consisting of, from about amino acid number 1-20. 21-40, 41-60, 61-80, 81-100, 102-120, 
121-140, 141-160, or 161 to the end of the coding region. Moreover, polypeptide 

15 fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 100, 1 10, 120, 130, 140, or 150 
amino acids in length. In this context "about" includes the particularly recited ranges or 
values, and ranges or values larger or smaller by several (5, 4, 3, 2, or 1) amino acids, at 
either extreme or at both extremes. Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

20 Preferred polypeptide fragments include the secreted protein as well as the mature 

form. Further preferred polypeptide fragments include the secreted protein or the mature 
form having a continuous series of deleted residues from the amino or the carboxy 
terminus, or both. For example, any number of amino acids, ranging from 1-60, can be 
deleted from the amino terminus of either the secreted polypeptide or the mature form. 

25 Similarly, any number of amino acids, ranging from 1-30, can be deleted from the carboxy 
terminus of the secreted protein or mature form. Furthermore, any combination of the 
above amino and carboxy terminus deletions are preferred. Similarly, polynucleotides 
encoding these polypeptide fragments are also preferred. 
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Also preferred are polypeptide and polynucleotide fragments characterized by 
structural or functional domains, such as fragments that comprise alpha-helix and alpha- 
helix forming regions, beta-sheet and beta-sheet-forming regions, turn and turn-forming 
regions, coil and coil-forming regions, hydrophilic regions, hydrophobic regions, alpha 
amphipathic regions, beta amphipathic regions, flexible regions, surface-forming regions, 
substrate binding region, and high antigenic index regions. Polypeptide fragments of SEQ 
ID NO: Y falling within conserved domains are specifically contemplated by the present 
invention. Moreover, polynucleotides encoding these domains are also contemplated. 

Other preferred polypeptide fragments are biologically active fragments. 
Biologically active fragments are those exhibiting activity similar, but not necessarily 
identical, to an activity of the polypeptide of the present invention. The biological activity 
of the fragments may include an improved desired activity, or a decreased undesirable 
activity. Polynucleotides encoding these polypeptide fragments are also encompassed by 
the invention. 

Preferably, the polynucleotide fragments of the invention encode a polypeptide 
which demonstrates a functional activity. By a polypeptide demonstrating a "functional 
activity" is meant, a polypeptide capable of displaying one or more known functional 
activities associated with a full-length (complete) polypeptide of invention protein. Such 
functional activities include, but are not limited to, biological activity, antigenicity [ability 
to bind (or compete with a polypeptide of the invention for binding) to an antibody to the 
polypeptide of the invention], immunogenics ty (ability to generate antibody which binds to 
a polypeptide of the invention), ability to form multimers with polypeptides of the 
invention, and ability to bind to a receptor or ligand for a polypeptide of the invention. 

The functional activity of polypeptides of the invention, and fragments, variants 
derivatives, and analogs thereof, can be assayed by various methods. 

For example, in one embodiment where one is assaying for the ability to bind or 
compete with full-length polypeptide of the invention for binding to an antibody of the 
polypeptide of the invention, various immunoassays known in the art can be used, 
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including but not limited to, competitive and non-competitive assay systems using 
techniques such as radioimmunoassays, ELISA (enzyme linked immunosorbent assay), 
"sandwich" immunoassays, immunoradiometric assays, gel diffusion precipitation 
reactions, immunodiffusion assays, in situ immunoassays (using colloidal gold, enzyme or 
radioisotope labels, for example), western blots, precipitation reactions, agglutination 
assays (e.g., gel agglutination assays, hemagglutination assays), complement fixation 
assays, immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, 
etc. In one embodiment, antibody binding is detected by detecting a label on the primary 
antibody. In another embodiment, the primary antibody is detected by detecting binding of 
a secondary antibody or reagent to the primary antibody. In a further embodiment, the 
secondary antibody is labeled. Many means are known in the art for detecting binding in 
an immunoassay and are within the scope of the present invention. 

In another embodiment, where a ligand for a polypeptide of the invention 
identified, or the ability of a polypeptide fragment, variant or derivative of the invention to 
multimerize is being evaluated, binding can be assayed, e.g., by means well-known in the 
art, such as, for example, reducing and non-reducing gel chromatography, protein affinity 
chromatography, and affinity blotting. See generally, Phizicky, R, et al., 1995, Microbiol. 
Rev. 59:94-123. In another embodiment, physiological correlates of binding of a 
polypeptide of the invention to its substrates (signal transduction) can be assayed. 

In addition, assays described herein (see Examples) and otherwise known in the art 
may routinely be applied to measure the ability of polypeptides of the invention and 
fragments, variants derivatives and analogs thereof to elicit related biological activity 
related to that of the polypeptide of the invention (either in vitro or in vivo). Other 
methods will be known to the skilled artisan and are within the scope of the invention. 

Epitopes ft ^tjhn^ 

The present invention is also directed to polypeptide fragments comprising, or 
alternatively consisting of, an epitope of the polypeptide sequence shown in SEQ ID 
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NO: Y, or the polypeptide sequence encoded by the cDNA contained in a deposited clone. 
Polynucleotides encoding these epitopes (such as, for example, the sequence disclosed in 
SEQ ID NO:X) are also encompassed by the invention, as is the nucleotide sequences of 
the complementary strand of the polynucleotides encoding these epitopes. And 
polynucleotides which hybridize to the complementary strand under stringent hybridization 
conditions or lower stringency conditions. 

In the present invention, "epitopes" refer to polypeptide fragments having antigenic 
or immunogenic activity in an animal, especially in a human. A preferred embodiment of 
the present invention relates to a polypeptide fragment comprising an epitope, as well as 
the polynucleotide encoding this fragment. A region of a protein molecule to which an 
antibody can bind is defined as an "antigenic epitope." In contrast, an "immunogenic 
epitope" is defined as a part of a protein that elicits an antibody response. (See, for 
instance, Geysen et al., Proc. Natl. Acad. Sci. USA 81:3998- 4002 (1983).) 

Fragments which function as epitopes may be produced by any conventional 
means. (See, e.g., Houghten, R. A., Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985) 
further described in U.S. Patent No. 4,63 1 ,21 1 .) 

In the present invention, antigenic epitopes preferably contain a sequence of at least 
4, at least 5. at least 6, at least 7, more preferably at least 8, at least 9. at least 10, at least 
15, at least 20, at least 25, and most preferably between about 15 to about 30 amino acids. 
Preferred polypeptides comprising immunogenic or antigenic epitopes are at least 10, 15, 
20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100 amino acid residues in 
length. Antigenic epitopes are useful, for example, to raise antibodies, including 
monoclonal antibodies, that specifically bind the epitope. (See, for instance, Wilson et al.. 
Cell 37:767-778 (1984); Sutcliffe et al., Science 219:660-666 (1983).) 

Similarly, immunogenic epitopes can be used, for example, to induce antibodies 
according to methods well known in the art. (See, for instance, Sutcliffe et al., supra; 
Wilson et al.. supra; Chow et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle et al., 
J. Gen. Virol. 66:2347-2354 (1985).) A preferred immunogenic epitope includes the 
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secreted protein. The immunogenic epitopes may be presented together with a carrier 
protein, such as an albumin, to an animal system (such as rabbit or mouse) or, if it is long 
enough (at least about 25 amino acids), without a carrier. However, immunogenic epitopes 
comprising as few as 8 to 10 amino acids have been shown to be sufficient to raise 
5 antibodies capable of binding to, at the very least, linear epitopes in a denatured 
polypeptide (e.g., in Western blotting.) 

Epitope-bearing polypeptides of the present invention may be used to induce 
antibodies according to methods well known in the art including, but not limited to, in vivo 
immunization, in vitro immunization, and phage display methods. See, e.g., Sutcliffe et 

10 al., supra; Wilson et al., supra, and Bittle et a!., J. Gen. Virol., 66:2347-2354 (1985). If in 
vivo immunization is used, animals may be immunized with free peptide; however, 
anti-peptide antibody titer may be boosted by coupling of the peptide to a macromolecular 
carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, 
peptides containing cysteine residues may be coupled to a carrier using a linker such as 

15 -maleimidobenzoyl- N-hydroxysuccinimide ester (MBS), while other peptides may be 
coupled to carriers using a more general linking agent such as glutaraldehyde. Animals 
such as rabbits, rats and mice are immunized with either free or carrier-coupled peptides, 
for instance, by intraperitoneal and/or intradermal injection of emulsions containing about 
100 /xgs of peptide or carrier protein and Freund's adjuvant. Several booster injections 

20 may be needed, for instance, at intervals of about two weeks, to provide a useful titer of 
anti-peptide antibody which can be detected, for example, by ELISA assay using free 
peptide adsorbed to a solid surface. The titer of anti-peptide antibodies in serum from an 
immunized animal may be increased by selection of anti-peptide antibodies, for instance, 
by adsorption to the peptide on a solid support and elution of the selected antibodies 

25 according to methods well known in the art. 

As one of skill in the art will appreciate, and discussed above, the polypeptides of 
the present invention comprising an immunogenic or antigenic epitope can be fused to 
heterologous polypeptide sequences. For example, the polypeptides of the present 
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invention may be fused with the constant domain of immunoglobulins (IgA, IgE, IgG, 
IgM), or portions thereof (CHI, CH2, CH3, any combination thereof including both entire 
domains and portions thereof) resulting in chimeric polypeptides. These fusion proteins 
facilitate purification, and show an increased half-life in vivo. This has been shown, e.g., 
for chimeric proteins consisting of the first two domains of the human CD4-polypeptide 
and various domains of the constant regions of the heavy or light chains of mammalian 
immunoglobulins. See, e.g., EPA 0,394,827; Traunecker et al., Nature, 331:84-86 (1988). 
Fusion proteins that have a disulfide-linked dimeric structure due to the IgG portion can 
also be more efficient in binding and neutralizing other molecules than monomeric 
polypeptides or fragments thereof alone. See, e.g., Fountoulakis et al., J. Biochem., 
270:3958-3964 (1995). Nucleic acids encoding the above epitopes can also be recombined 
with a gene of interest as an epitope tag to aid in detection and purification of the 
expressed polypeptide. 

Additional fusion proteins of the invention may be generated through the 
techniques of gene-shuffling, motif-shuffling, exon -shuffling, and/or codon-shuffling 
(collectively referred to as "DNA shuffling"). DNA shuffling may be employed to 
modulate the activities of polypeptides corresponding to SEQ ID NO:Y thereby effectively 
generating agonists and antagonists of the polypeptides. See.generally, U.S. Patent Nos. 
5,605,793, 5,81 1,238, 5,830,721, 5,834,252, and 5,837,458, and Patten, P.A., et al., Curr. 
Opinion Biotechnol. 8:724-33 (1997); Harayama, S., Trends Biotechnol. 16(2):76-82 
(1998); Hansson, L.O., et al., J. Mol. Biol. 287:265-76 (1999); and Lorenzo, M. M. and 
Blasco, R., Biotechniques 24(2):308-13 (1998) (each of these patents and publications are 
hereby incorporated by reference). In one embodiment, alteration of polynucleotides 
corresponding to SEQ ID NO:X and corresponding polypeptides may be achieved by DNA 
shuffling. DNA shuffling involves the assembly of two or more DNA segments into a 
desired molecule corresponding to SEQ ID NO:X polynucleotides of the invention by 
homologous, or site-specific, recombination. In another embodiment, polynucleotides 
corresponding to SEQ ID NO:X and corresponding polypeptides may be altered by being 
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subjected to random mutagenesis by error-prone PCR, random nucleotide insertion or 
other methods prior to recombination. In another embodiment, one or more components, 
motifs, sections, parts, domains, fragments, etc., of coding polynucleotide corresponding to 
SEQ ID NO:X, or the polypeptide encoded thereby may be recombined with one or more 
components, motifs, sections, parts, domains, fragments, etc. of one or more heterologous 
molecules. 



Antibodies 

The present invention further relates to antibodies and T-cell antigen receptors 
(TCR) which specifically bind the polypeptides of the present invention. The antibodies of 
the present invention include IgG (including IgGl, IgG2, IgG3, and IgG4), IgA (including 
IgAl and IgA2), IgD, IgE, or IgM, and IgY. As used herein, the term "antibody" (Ab) is 
meant to include whole antibodies, including single-chain whole antibodies, and antigen- 
binding fragments thereof. Most preferably the antibodies are human antigen binding 
antibody fragments of the present invention and include, but are not limited to, Fab, Fab' 
and F(ab')2, Fd, single-chain Fvs (scFv), single-chain antibodies, disulfide-linked Fvs 
(sdFv) and fragments comprising either a V L or V H domain. The antibodies may be from 
any animal origin including birds and mammals. Preferably, the antibodies are human, 
murine, rabbit, goat, guinea pig, camel, horse, or chicken. 

Antigen-binding antibody fragments, including single-chain antibodies, may 
comprise the variable region(s) alone or in combination with the entire or partial of the 
following: hinge region, CHI, CH2, and CH3 domains. Also included in the invention are 
any combinations of variable region(s) and hinge region, CHI, CH2, and CH3 domains. 
The present invention further includes monoclonal, polyclonal, chimeric, humanized, and 
human monoclonal and human polyclonal antibodies which specifically bind the 
polypeptides of the present invention. The present invention further includes antibodies 
which are anti-idiotypic to the antibodies of the present invention. 

The antibodies of the present invention may be monospecific, bispecific, trispecific 
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or of greater multispecificity. Multispecific antibodies may be specific for different 
epitopes of a polypeptide of the present invention or may be specific for both a polypeptide 
of the present invention as well as for heterologous compositions, such as a heterologous 
polypeptide or solid support material. See, e.g., WO 93/17715; WO 92/08802; WO 
91/00360; WO 92/05793; Tutt, et al., J. Immunol. 147:60-69 (1991); US Patents 
5,573,920, 4,474,893, 5,601,819, 4,714,681, 4,925,648; Kostelny et al., J. Immunol. 
148:1547-1553(1992). 

Antibodies of the present invention may be described or specified in terms of the 
epitope(s) or portion(s) of a polypeptide of the present invention which are recognized or 
specifically bound by the antibody. The epitope(s) or polypeptide portion(s) may be 
specified as described herein, e.g., by N-terminal and C-terminal positions, by size in 
contiguous amino acid residues, or listed in the Tables and Figures. Antibodies which 
specifically bind any epitope or polypeptide of the present invention may also be excluded. 
Therefore, the present invention includes antibodies that specifically bind polypeptides of 
the present invention, and allows for the exclusion of the same. 

Antibodies of the present invention may also be described or specified in terms of 
their cross-reactivity. Antibodies that do not bind any other analog, ortholog, or homolog 
of the polypeptides of the present invention are included. Antibodies that do not bind 
polypeptides with less than 95%, less than 90%, less than 85%, less than 80%, less than 
75%, less than 70%, less than 65%, less than 60%, less than 55%, and less than 50% 
identity (as calculated using methods known in the art and described herein) to a 
polypeptide of the present invention are also included in the present invention. Further 
included in the present invention are antibodies which only bind polypeptides encoded by 
polynucleotides which hybridize to a polynucleotide of the present invention under 
stringent hybridization conditions (as described herein). Antibodies of the present 
invention may also be described or specified in terms of their binding affinity. Preferred 
binding affinities include those with a dissociation constant or Kd less than 5X10"*M, 10 
6 M, 5X10- 7 M, lO-'M, 5X10*M, 10 »M, 5X10" 9 M, 10'M, 5X10 l0 M, 10 '°M, 5X10 "M, 10 
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"M, 5X10- ,2 M, 10 ,2 M, 5X10- ,} M, 10 »M, 5X10 ,4 M, 10 ,4 M, 5X10 ,5 M, and 10 ,S M. 

Antibodies of the present invention have uses that include, but are not limited to, 
methods known in the art to purify, detect, and target the polypeptides of the present 
invention including both in vitro and in vivo diagnostic and therapeutic methods. For 
5 example, the antibodies have use in immunoassays for qualitatively and quantitatively 
measuring levels of the polypeptides of the present invention in biological samples. See, 
e.g., Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor 
Laboratory Press, 2nd ed. 1988) (incorporated by reference in the entirety). 

The antibodies of the present invention may be used either alone or in combination 

10 with other compositions. The antibodies may further be recombinantly fused to a 
heterologous polypeptide at the N- or C-terminus or chemically conjugated (including 
covalently and non-covalently conjugations) to polypeptides or other compositions. For 
example, antibodies of the present invention may be recombinantly fused or conjugated to 
molecules useful as labels in detection assays and effector molecules such as heterologous 

15 polypeptides, drugs, or toxins. See, e.g., WO 92/08495; WO 91/14438; WO 89/12624; US 
Patent 5,314,995; and EP 0 396 387. 

The antibodies of the present invention may be prepared by any suitable method known in 
the art. For example, a polypeptide of the present invention or an antigenic fragment 
thereof can be administered to an animal in order to induce the production of sera 

20 containing polyclonal antibodies. The term "monoclonal antibody" is not a limited to 
antibodies produced through hybridoma technology. The term "monoclonal antibody" 
refers to an antibody that is derived from a single clone, including any eukaryotic, 
prokaryotic, or phage clone, and not the method by which it is produced. Monoclonal 
antibodies can be prepared using a wide variety of techniques known in the art including 

25 the use of hybridoma, recombinant, and phage display technology. 

Hybridoma techniques include those known in the art and taught in Harlow et al., 
ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 
2nd ed. 1988); Hammerling, et al., in: MONOCLONAL ANTIBODIES AND T-CELL 
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HYBRIDOMAS 563-681 (Elsevier, N.Y., 1981) (said references incorporated by 
reference in their entireties). Fab and F(ab')2 fragments may be produced by proteolytic 
cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin (to produce 
F(ab')2 fragments). 

Alternatively, antibodies of the present invention can be produced through the 
application of recombinant DNA and phage display technology or through synthetic 
chemistry using methods known in the art. For example, the antibodies of the present 
invention can be prepared using various phage display methods known in the art. In phage 
display methods, functional antibody domains are displayed on the surface of a phage 
particle which carries polynucleotide sequences encoding them. Phage with a desired 
binding property are selected from a repertoire or combinatorial antibody library (e.g. 
human or murine) by selecting directly with antigen, typically antigen bound or captured to 
a solid surface or bead. Phage used in these methods are typically filamentous phage 
including fd and M13 with Fab, Fv or disulfide stabilized Fv antibody domains 
recombinantly fused to either the phage gene III or gene VIII protein. Examples of phage 
display methods that can be used to make the antibodies of the present invention include 
those disclosed in Brinkman et al., J. Immunol. Methods 182:41-50 (1995); Ames et al., J. 
Immunol. Methods 184: 177-186 (1995); Kettleborough et al., Eur. J. Immunol. 24:952-958 
(1994); Persic et al.. Gene 187 9-18 (1997); Burton et al., Advances in Immunology 
57:191-280 (1994); PCT/GB91/01134; WO 90/02809; WO 91/10737; WO 92/01047; WO 
92/18619; WO 93/1 1236; WO 95/15982; WO 95/20401; and US Patents 5,698,426, 
5,223,409, 5,403,484, 5,580,717, 5,427,908, 5,750,753, 5,821,047, 5,571,698, 5,427,908, 
5,516,637, 5,780,225. 5,658,727 and 5,733,743 (said references incorporated by reference 
in their entireties). 

As described in the above references, after phage selection, the antibody coding 
regions from the phage can be isolated and used to generate whole antibodies, including 
human antibodies, or any other desired antigen binding fragment, and expressed in any 
desired host including mammalian cells, insect cells, plant cells, yeast, and bacteria. For 
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example, techniques to recombinantly produce Fab, Fab' and F(ab')2 fragments can also be 
employed using methods known in the art such as those disclosed in WO 92/22324; 
Mullinax et al., BioTechniques 12(6):864-869 (1992); and Sawai et al., AJRI 34:26-34 
(1995); and Better et al., Science 240:1041-1043 (1988) (said references incorporated by 
reference in their entireties). 

Examples of techniques which can be used to produce single-chain Fvs and 
antibodies include those described in U.S. Patents 4,946,778 and 5,258,498; Huston et al., 
Methods in Enzymology 203:46-88 (1991); Shu, L. et al., PNAS 90:7995-7999 (1993); 
and Skerra et al., Science 240:1038-1040 (1988). For some uses, including in vivo use of 
antibodies in humans and in vitro detection assays, it may be preferable to use chimeric, 
humanized, or human antibodies. Methods for producing chimeric antibodies are known 
in the art. See e.g., Morrison, Science 229:1202 (1985); Oi et al., BioTechniques 4:214 
(1986); Gillies et al., (1989) J. Immunol. Methods 125:191-202; and US Patent 5,807,715. 
Antibodies can be humanized using a variety of techniques including CDR-grafting (EP 0 
239 400; WO 91/09967; US Patent 5,530,101; and 5,585,089), veneering or resurfacing 
(EP 0 592 106; EP 0 519 596; Padlan E.A., Molecular Immunology 28(4/5):489-498 
(1991); Studnicka et al., Protein Engineering 7(6):805-814 (1994); Roguska. et al., PNAS 
91:969-973 (1994)), and chain shuffling (US Patent 5,565,332). Human antibodies can be 
made by a variety of methods known in the art including phage display methods described 
above. See also, US Patents 4,444,887, 4,716,1 1 1, 5,545,806, and 5,814,318; and WO 
98/46645, WO 98/50433, WO 98/24893, WO 98/16654, WO 96/34096, WO 96/33735, 
and WO 91/10741 (said references incorporated by reference in their entireties). 

Further included in the present invention are antibodies recombinantly fused or 
chemically conjugated (including both covalently and non-covalently conjugations) to a 
polypeptide of the present invention. The antibodies may be specific for antigens other 
than polypeptides of the present invention. For example, antibodies may be used to target 
the polypeptides of the present invention to particular cell types, either in vitro or in vivo, 
by fusing or conjugating the polypeptides of the present invention to antibodies specific for 
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particular cell surface receptors. Antibodies fused or conjugated to the polypeptides of the 
present invention may also be used in in vitro immunoassays and purification methods 
using methods known in the art. See e.g., Harbor et al. supra and WO 93/21232; EP 0 439 
095; Naramura et al., Immunol. Lett. 39:91-99 (1994); US Patent 5,474,981; Gillies et al., 
PNAS 89:1428-1432 (1992); Fell et al., J. Immunol. 146:2446-2452 (1991) (said 
references incorporated by reference in their entireties). 

The present invention further includes compositions comprising the polypeptides of 
the present invention fused or conjugated to antibody domains other than the variable 
regions. For example, the polypeptides of the present invention may be fused or 
conjugated to an antibody Fc region, or portion thereof. The antibody portion fused to a 
polypeptide of the present invention may comprise the hinge region, CHI domain, CH2 
domain, and CH3 domain or any combination of whole domains or portions thereof. The 
polypeptides of the present invention may be fused or conjugated to the above antibody 
portions to increase the in vivo half life of the polypeptides or for use in immunoassays 
using methods known in the art. The polypeptides may also be fused or conjugated to the 
above antibody portions to form multimers. For example, Fc portions fused to the 
polypeptides of the present invention can form dimers through disulfide bonding between 
the Fc portions. Higher multimeric forms can be made by fusing the polypeptides to 
portions of IgA and IgM. Methods for fusing or conjugating the polypeptides of the present 
invention to antibody portions are known in the art. See e.g., US Patents 5,336,603, 
5,622,929, 5,359,046, 5,349,053, 5,447,851, 5,1 12,946; EP 0 307 434, EP 0 367 166; WO 
96/04388, WO 91/06570; Ashkenazi et al., PNAS 88:10535-10539 (1991); Zheng et al., J. 
Immunol. 154:5590-5600 (1995); and Vil et al., PNAS 89:1 1337-1 1341 (1992) (said 
references incorporated by reference in their entireties). 

The invention further relates to antibodies which act as agonists or antagonists of 
the polypeptides of the present invention. For example, the present invention includes 
antibodies which disrupt the receptor/I igand interactions with the polypeptides of the 
invention either partially or fully. Included are both receptor-specific antibodies and 
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ligand-specific antibodies. Included are receptor-specific antibodies which do not prevent 
ligand binding but prevent receptor activation. Receptor activation (i.e., signaling) may be 
determined by techniques described herein or otherwise known in the art. Also included 
are receptor-specific antibodies which both prevent ligand binding and receptor activation. 
Likewise, included are neutralizing antibodies which bind the ligand and prevent binding 
of the ligand to the receptor, as well as antibodies which bind the ligand, thereby 
preventing receptor activation, but do not prevent the ligand from binding the receptor. 
Further included are antibodies which activate the receptor. These antibodies may act as 
agonists for either all or less than all of the biological activities affected by ligand- 
mediated receptor activation. The antibodies may be specified as agonists or antagonists 
for biological activities comprising specific activities disclosed herein. The above 
antibody agonists can be made using methods known in the art. See e.g., WO 96/40281; 
US Patent 5,81 1,097; Deng et aL, Blood 92(6): 1981-1988 (1998); Chen, et al., Cancer Res. 
58(16):3668-3678 (1998); Harrop et al., J. Immunol. 161(4): 1786-1794 (1998); Zhu et al., 
Cancer Res. 58(15):3209-3214 (1998); Yoon, et aL, J. Immunol. 160(7):3170-3179 (1998); 
Prat et al., J. Cell. Sci. lll(Pt2):237-247 (1998); Pitard et al., J. Immunol. Methods 
205(2):177-190 (1997); Liautard et aL, Cytokinde 9(4):233-241 (1997); Carlson et al., J. 
Biol. Chem. 272(17): 11295-1 1301 (1997); Taryman et aL, Neuron 14(4):755-762 (1995); 
Mulleret a!., Structure 6(9): 1153-1 167 (1998); Bartunek et aL, Cytokine 8(l):14-20 (1996) 
(said references incorporated by reference in their entireties). 

As discussed above, antibodies to the polypeptides of the invention can, in turn, be 
utilized to generate anti-idiotype antibodies that "mimic" polypeptides of the invention 
using techniques well known to those skilled in the art. (See, e.g., Greenspan & Bona, 
FASEB J. 7(5):437-444; (1989) and Nissinoff, J. Immunol. 147(8):2429-2438 (1991)). 
For example, antibodies which bind to and competitively inhibit polypeptide 
multimerization and/or binding of a polypeptide of the invention to ligand can be used to 
generate anti-idiotypes that "mimic" the polypeptide mutimerization and/or binding 
domain and, as a consequence, bind to and neutralize polypeptide and/or its ligand. Such 
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neutralizing anti-idiotypes or Fab fragments of such anti-idiotypes can be used in 
therapeutic regimens to neutralize polypeptide ligand. For example, such anti-idiotypic 
antibodies can be used to bind a polypeptide of the invention and/or to bind its 
ligands/receptors, and thereby block its biological activity. 

The invention further relates to a diagnostic kit for use in screening serum 
containing antibodies specific against proliferative and/or cancerous polynucleotides and 
polypeptides. Such a kit may include a substantially isolated polypeptide antigen 
comprising an epitope which is specifically immunoreactive with at least one anti- 
polypeptide antigen antibody. Such a kit also includes means for detecting the binding of 
said antibody to the antigen. In specific embodiments, the kit may include a recombinantly 
produced or chemically synthesized polypeptide antigen. The polypeptide antigen of the 
kit may also be attached to a solid support. 

In a more specific embodiment the detecting means of the above-described kit 
includes a solid support to which said polypeptide antigen is attached. Such a kit may also 
include a non-attached reporter-labelled anti-human antibody. In this embodiment, binding 
of the antibody to the polypeptide antigen can be detected by binding of the said reporter- 
labelled antibody. 

The invention further includes a method of detecting proliferative and/or cancerous 
disorders and conditions in a test subject. This detection method includes reacting serum 
from a test subject (e.g. one in which proliferative and/or cancerous cells or tissues may be 
present) with a substantially isolated polypeptide and/or polynucleotide antigen, and 
examining the antigen for the presence of bound antibody. In a specific embodiment, the 
method includes a polypeptide antigen attached to a solid support, and the serum is reacted 
with the support. Subsequently, the support is reacted with a reporter labelled anti-human 
antibody. The solid support is then examined for the presence of reporter-labelled 
antibody. 

Additionally, the invention includes a proliferative condition vaccine composition. 
The composition includes a substantially isolated polypeptide and/or polynucleotide 
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antigen, where the antigen includes an epitope which is specifically immunoreactive with 
at least antibody specific for the epitope. The peptide and/or polynucleotide antigen may 
be produced according to methods known in the art, including recombinant expression or 
chemical synthesis. The peptide antigen is preferably present in a pharmacologically 
5 effective dose in a pharmaceutical ly acceptable carrier. 

Further, the invention includes a monoclonal antibody that is specifically 
immunoreactive with polypeptide and/or polynucleotide epitopes. The invention includes a 
substantially isolated preparation of polyclonal antibodies specifically immunoreactive 
with polynucleotides and/or polypeptides of the present invention. In a more specific 

10 embodiment, such polyclonal antibodies are prepared by affinity chromatography, in 
addition to, other methods known in the art. 

In another emodiment, the invention includes a method for producing antibodies to 
polypeptide and/or polynucleotide antigens. The method includes administering to a test 
subject a substantially isolated polypeptide and/or polynucleotide antigen, where the 

15 antigen includes an epitope which is specifically immunoreactive with at least one anti- 
polypeptide and/or polynucleotide antibody. The antigen is administered in an amount 
sufficient to produce an immune response in the subject. 

In an additional embodiment, the invention includes a diagnostic kit for use in 
screening serum containing antigens of the polypeptide of the invention. The diagnostic kit 

20 includes a substantially isolated antibody specifically immunoreactive with polypeptide or 
polynucleotide antigens, and means for detecting the binding of the polynucleotide or 
polypeptide antigen to the antibody. In one embodiment, the antibody is attached to a solid 
support. In a specific embodiment, the antibody may be a monoclonal antibody. The 
detecting means of the kit may include a second, labelled monoclonal antibody. 

25 Alternatively, or in addition, the detecting means may include a labelled, competing 
antigen. 

In one diagnostic configuration, test serum is reacted with a solid phase reagent 
having a surface-bound antigen obtained by the methods of the present invention. After 
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binding with specific antigen antibody to the reagent and removing unbound serum 
components by washing, the reagent is reacted with reporter-labelled anti-human antibody 
to bind reporter to the reagent in proportion to the amount of bound anti-antigen antibody 
on the solid support. The reagent is again washed to remove unbound labelled antibody, 
and the amount of reporter associated with the reagent is determined. Typically, the 
reporter is an enzyme which is detected by incubating the solid phase in the presence of a 
suitable fluorometric or colorimetric substrate (Sigma, St. Louis, MO). 

The solid surface reagent in the above assay is prepared by known techniques for 
attaching protein material to solid support material, such as polymeric beads, dip sticks, 
96-well plate or filter material. These attachment methods generally include non-specific 
adsorption of the protein to the support or covalent attachment of the protein, typically 
through a free amine group, to a chemically reactive group on the solid support, such as an 
activated carboxyl, hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates 
can be used in conjunction with biotinylated antigen(s). 

Thus, the invention provieds an assay system or kit for carrying out this diagnostic 
method. The kit generally includes a support with surface-bound recombinant antigens, 
and a reporter-labelled anti-human antibody for detecting surface-bound anti-antigen 
antibody. 

Fusion Proteins 

Any polypeptide of the present invention can be used to generate fusion proteins. 
For example, the polypeptide of the present invention, when fused to a second protein, can 
be used as an antigenic tag. Antibodies raised against the polypeptide of the present 
invention can be used to indirectly detect the second protein by binding to the polypeptide. 
Moreover, because secreted proteins target cellular locations based on trafficking signals, 
the polypeptides of the present invention can be used as targeting molecules once fused to 
other proteins. 
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Examples of domains that can be fused to polypeptides of the present invention 
include not only heterologous signal sequences, but also other heterologous functional 
regions. The fusion does not necessarily need to be direct, but may occur through linker 
sequences. 

5 Moreover, fusion proteins may also be engineered to improve characteristics of the 

polypeptide of the present invention. For instance, a region of additional amino acids, 
particularly charged amino acids, may be added to the N-terminus of the polypeptide to 
improve stability and persistence during purification from the host cell or subsequent 
handling and storage. Also, peptide moieties may be added to the polypeptide to facilitate 

10 purification. Such regions may be removed prior to final preparation of the polypeptide. 
The addition of peptide moieties to facilitate handling of polypeptides are familiar and 
routine techniques in the art. 

Moreover, polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined with parts of the constant domain of 

15 immunoglobulins (IgA, IgE, IgG, IgM) or portions thereof (CHI, CH2, CH3, and any 
combination thereof, including both entire domains and portions thereof), resulting in 
chimeric polypeptides. These fusion proteins facilitate purification and show an increased 
half-life in vivo. One reported example describes chimeric proteins consisting of the first 
two domains of the human CD4-polypeptide and various domains of the constant regions 

20 of the heavy or light chains of mammalian immunoglobulins. (EP A 394,827; Traunecker 
et al., Nature 331:84-86 (1988).) Fusion proteins having disulfide-linked dimeric 
structures (due to the IgG) can also be more efficient in binding and neutralizing other 
molecules, than the monomelic secreted protein or protein fragment alone. (Fountoulakis 
et al., J. Biochem. 270:3958-3964 (1995).) 

25 Similarly, EP-A-0 464 533 (Canadian counterpart 2045869) discloses fusion 

proteins comprising various portions of constant region of immunoglobulin molecules 
together with another human protein or part thereof. In many cases, the Fc part in a fusion 
protein is beneficial in therapy and diagnosis, and thus can result in, for example, improved 
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pharmacokinetic properties. (EP-A 0232 262.) Alternatively, deleting the Fc part after the 
fusion protein has been expressed, detected, and purified, would be desired. For example, 
the Fc portion may hinder therapy and diagnosis if the fusion protein is used as an antigen 
for immunizations. In drug discovery, for example, human proteins, such as hIL-5, have 
been fused with Fc portions for the purpose of high-throughput screening assays to identify 
antagonists of hIL-5. (See, D. Bennett et al., J. Molecular Recognition 8:52-58 (1995); K. 
Johanson et al., J. Biol. Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. In 
preferred embodiments, the marker amino acid sequence is a hexa-histidine peptide, such 
as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311), among others, many of which are commercially available. As described in Gentz 
et al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for instance, hexa-histidine provides 
for convenient purification of the fusion protein. Another peptide tag useful for 
purification, the "HA" tag, corresponds to an epitope derived from the influenza 
hemagglutinin protein. (Wilson et al., Cell 37:767 (1984).) 

Thus, any of these above fusions can be engineered using the polynucleotides or 
the polypeptides of the present invention. 



Vectors. Host Cells, and Protein Prnrinrtinn 

The present invention also relates to vectors containing the polynucleotide of the 
present invention, host cells, and the production of polypeptides by recombinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral vector. 
Retroviral vectors may be replication competent or replication defective. In the latter case, 
viral propagation generally will occur only in complementing host cells. 
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The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as a 
calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is a 
virus, it may be packaged in vitro using an appropriate packaging cell line and then 
5 transduced into host cells. 

The polynucleotide insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp, phoA and tac promoters, the 
SV40 early and late promoters and promoters of retroviral LTRs, to name a few. Other 
suitable promoters will be known to the skilled artisan. The expression constructs will 

10 further contain sites for transcription initiation, termination, and, in the transcribed region, 
a ribosome binding site for translation. The coding portion of the transcripts expressed by 
the constructs will preferably include a translation initiating codon at the beginning and a 
termination codon (UAA, UGA or UAG) appropriately positioned at the end of the 
polypeptide to be translated. 

15 As indicated, the expression vectors will preferably include at least one selectable 

marker. Such markers include dihydrofolate reductase, G418 or neomycin resistance for 
eukaryotic cell culture and tetracycline, kanamycin or ampicillin resistance genes for 
culturing in E. coli and other bacteria. Representative examples of appropriate hosts 
include, but are not limited to, bacterial cells, such as E. coli, Streptomyces and Salmonella 

20 typhimurium cells; fungal cells, such as yeast cells; insect cells such as Drosophila S2 and 
Spodoptera Sf9 cells; animal cells such as CHO, COS, 293, and Bowes melanoma cells; 
and plant cells. Appropriate culture mediums and conditions for the above-described host 
cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, 
25 available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, pNH16a, 
pNH18A, pNH46A, available from Stratagene Cloning Systems, Inc.; and ptrc99a, 
pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia Biotech, Inc. Among 
preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl and pSG available 
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from Stratagene; and pSVK3, pBPV, pMSG and pS VL available from Pharmacia. Other 
suitable vectors will be readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium phosphate 
transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, 
electroporation, transduction, infection, or other methods. Such methods are described in 
many standard laboratory manuals, such as Davis et al., Basic Methods In Molecular 
Biology (1986). It is specifically contemplated that the polypeptides of the present 
invention may in fact be expressed by a host cell lacking a recombinant vector. 

A polypeptide of this invention can be recovered and purified from recombinant 
cell cultures by well-known methods including ammonium sulfate or ethanol precipitation, 
acid extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity chromatography, 
hydroxylapatite chromatography and lectin chromatography. Most preferably, high 
performance liquid chromatography ("HPLC") is employed for purification. 

Polypeptides of the present invention, and preferably the secreted form, can also be 
recovered from: products purified from natural sources, including bodily fluids, tissues 
and cells, whether directly isolated or cultured; products of chemical synthetic procedures; 
and products produced by recombinant techniques from a prokaryotic or eukaryotic host, 
including, for example, bacterial, yeast, higher plant, insect, and mammalian cells. 
Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. In 
addition, polypeptides of the invention may also include an initial modified methionine 
residue, in some cases as a result of host-mediated processes. Thus, it is well known in the 
art that the N-terminal methionine encoded by the translation initiation codon generally is 
removed with high efficiency from any protein after translation in all eukaryotic cells. 
While the N-terminal methionine on most proteins also is efficiently removed in most 
prokaryotes, for some proteins, this prokaryotic removal process is inefficient, depending 
on the nature of the amino acid to which the N-terminal methionine is covalently linked. 
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In addition to encompassing host cells containing the vector constructs discussed 
herein, the invention also encompasses primary, secondary, and immortalized host cells of 
vertebrate origin, particularly mammalian origin, that have been engineered to delete or 
replace endogenous genetic material (e.g., coding sequence), and/or to include genetic 
5 material (e.g., heterologous polynucleotide sequences) that is operably associated with the 
polynucleotides of the invention, and which activates, alters, and/or amplifies endogenous 
polynucleotides. For example, techniques known in the art may be used to operably 
associate heterologous control regions (e.g., promoter and/or enhancer) and endogenous 
polynucleotide sequences via homologous recombination (see, e.g., U.S. Patent No. 
10 5,641,670, issued June 24, 1997; International Publication No. WO 96/2941 1, published 
September 26, 1996; International Publication No. WO 94/12650, published August 4, 
1994; Roller et al., Proc. Natl. Acad. Sci. USA 86:8932-8935 (1989); and Zijlstra et al., 
Nature 342:435-438 (1989), the disclosures of each of which are incorporated by reference 
in their entireties). 

1 5 In addition, polypeptides of the invention can be chemically synthesized using 

techniques known in the art (e.g., see Creighton, 1983, Proteins: Structures and Molecular 
Principles, W.H. Freeman & Co., N.Y., and Hunkapilleret al, Nature, 310:105-111 
(1984)). For example, a polypeptide corresponding to a fragment of a polypeptide 
sequence of the invention can be synthesized by use of a peptide synthesizer. Furthermore, 

20 if desired, nonclassical amino acids or chemical amino acid analogs can be introduced as a 
substitution or addition into the polypeptide sequence. Non-classical amino acids include, 
but are not limited to, to the D-isomers of the common amino acids, 2,4-diaminobutyric 
acid, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, g-Abu, 
e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, 

25 ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, 
cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, b-alanine, 
fluoro-amino acids, designer amino acids such as b-methyl amino acids, Ca-methyl amino 
acids, Na-methyl amino acids, and amino acid analogs in general. Furthermore, the amino 
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acid can be D (dextrorotary) or L (levorotary). 

The invention encompasses polypeptides which are differentially modified during 
or after translation, e.g., by glycosylate, acetylation, phosphorylation, amidation, 
derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to an 
antibody molecule or other cellular ligand, etc. Any of numerous chemical modifications 
may be carried out by known techniques, including but not limited, to specific chemical 
cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 protease, NaBH 4 ; 
acetylation, formylation, oxidation, reduction; metabolic synthesis in the presence of 
tunicamycin; etc. 

Additional post-translational modifications encompassed by the invention include, 
for example, e.g., N-linked or O-linked carbohydrate chains, processing of N-terminal or 
C-terminal ends), attachment of chemical moieties to the amino acid backbone, chemical 
modifications of N-linked or O-linked carbohydrate chains, and addition or deletion of an 
N-terminal methionine residue as a result of procaryotic host cell expression. The 
polypeptides may also be modified with a detectable label, such as an enzymatic, 
fluorescent, isotopic or affinity label to allow for detection and isolation of the protein. 

Also provided by the invention are chemically modified derivatives of the 
polypeptides of the invention which may provide additional advantages such as increased 
solubility, stability and circulating time of the polypeptide, or decreased immunogenicity 
(see U.S. Patent NO: 4,179,337). The chemical moieties for derivitization may be selected 
from water soluble polymers such as polyethylene glycol, ethylene glycol/propylene glycol 
copolymers, carboxymethylcellulose, dextran, polyvinyl alcohol and the like. The 
polypeptides may be modified at random positions within the molecule, or at 
predetermined positions within the molecule and may include one, two, three or more 
attached chemical moieties. 

The polymer may be of any molecular weight, and may be branched or unbranched. 
For polyethylene glycol, the preferred molecular weight is between about I kDa and about 
100 kDa (the term "about" indicating that in preparations of polyethylene glycol, some 

SUBSTITUTE SHEET (RULE 26) 



W0 00/29435 PCT/US99/25031 

377 

molecules will weigh more, some less, than the stated molecular weight) for ease in 
handling and manufacturing. Other sizes may be used, depending on the desired 
therapeutic profile (e.g., the duration of sustained release desired, the effects, if any on 
biological activity, the ease in handling, the degree or lack of antigenicity and other known 
effects of the polyethylene glycol to a therapeutic protein or analog). 

The polyethylene glycol molecules (or other chemical moieties) should be attached 
to the protein with consideration of effects on functional or antigenic domains of the 
protein. There are a number of attachment methods available to those skilled in the art, 
e.g., EP 0 401 384, herein incorporated by reference (coupling PEG to G-CSF), see also 
Malik et al., Exp. Hematol. 20:1028-1035 (1992) (reporting pegylation of GM-CSF using 
tresyl chloride). For example, polyethylene glycol may be covalently bound through 
amino acid residues via a reactive group, such as, a free amino or carboxyl group. 
Reactive groups are those to which an activated polyethylene glycol molecule may be 
bound. The amino acid residues having a free amino group may include lysine residues 
and the N-terminal amino acid residues; those having a free carboxyl group may include 
aspartic acid residues glutamic acid residues and the C-terminal amino acid residue. 
Sulfhydryl groups may also be used as a reactive group for attaching the polyethylene 
glycol molecules. Preferred for therapeutic purposes is attachment at an amino group, such 
as attachment at the N-terminus or lysine group. 

One may specifically desire proteins chemically modified at the N-terminus. Using 
polyethylene glycol as an illustration of the present composition, one may select from a 
variety of polyethylene glycol molecules (by molecular weight, branching, etc.), the 
proportion of polyethylene glycol molecules to protein (polypeptide) molecules in the 
reaction mix, the type of pegylation reaction to be performed, and the method of obtaining 
the selected N-terminally pegylated protein. The method of obtaining the N-terminally 
pegylated preparation (i.e., separating this moiety from other monopegylated moieties if 
necessary) may be by purification of the N-terminally pegylated material from a 
population of pegylated protein molecules. Selective proteins chemically modified at the 
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N-terminus modification may be accomplished by reductive alkylation which exploits 
differential reactivity of different types of primary amino groups (lysine versus the 
N-terminal) available for derivatization in a particular protein. Under the appropriate 
reaction conditions, substantially selective derivatization of the protein at the N-terminus 
with a carbonyl group containing polymer is achieved. 

The polypeptides of the invention may be in monomers or multimers (i.e., dimers, 
trimers, tetramers and higher multimers). Accordingly, the present invention relates to 
monomers and multimers of the polypeptides of the invention, their preparation, and 
compositions (preferably, Therapeutics) containing them. In specific embodiments, the 
polypeptides of the invention are monomers, dimers, trimers or tetramers. In additional 
embodiments, the multimers of the invention are at least dimers, at least trimers, or at least 
tetramers. 

Multimers encompassed by the invention may be homomers or heteromers. As 
used herein, the term homomer, refers to a multimer containing only polypeptides 
corresponding to the amino acid sequence of SEQ ID NO: Y or encoded by the cDNA 
contained in a deposited clone (including fragments, variants, splice variants, and fusion 
proteins, corresponding to these polypeptides as described herein). These homomers may 
contain polypeptides having identical or different amino acid sequences. In a specific 
embodiment, a homomer of the invention is a multimer containing only polypeptides 
having an identical amino acid sequence. In another specific embodiment, a homomer of 
the invention is a multimer containing polypeptides having different amino acid sequences. 
In specific embodiments, the multimer of the invention is a homodimer (e.g., containing 
polypeptides having identical or different amino acid sequences) or a homotrimer (e.g., 
containing polypeptides having identical and/or different amino acid sequences). In 
additional embodiments, the homomeric multimer of the invention is at least a homodimer. 
at least a homotrimer, or at least a homotetramer. 

As used herein, the term heteromer refers to a multimer containing one or more 
heterologous polypeptides (i.e., polypeptides of different proteins) in addition to the 
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polypeptides of the invention. In a specific embodiment, the multimer of the invention is a 
heterodimer, a heterotrimer, or a heterotetramer. In additional embodiments, the 
heteromeric multimer of the invention is at least a heterodimer, at least a heterotrimer, or at 
least a heterotetramer. 

5 Multimers of the invention may be the result of hydrophobic, hydrophilic, ionic 

and/or covalent associations and/or may be indirectly linked, by for example, liposome 
formation. Thus, in one embodiment, multimers of the invention, such as, for example, 
homodimers or homotrimers, are formed when polypeptides of the invention contact one 
another in solution. In another embodiment, heteromultimers of the invention, such as, for 

10 example, heterotrimers or heterotetramers, are formed when polypeptides of the invention 
contact antibodies to the polypeptides of the invention (including antibodies to the 
heterologous polypeptide sequence in a fusion protein of the invention) in solution. In 
other embodiments, multimers of the invention are formed by covalent associations with 
and/or between the polypeptides of the invention. Such covalent associations may involve 

15 one or more amino acid residues contained in the polypeptide sequence ( e.g., that recited 
in the sequence listing, or contained in the polypeptide encoded by a deposited clone). In 
one instance, the covalent associations are cross-linking between cysteine residues located 
within the polypeptide sequences which interact in the native (i.e., naturally occurring) 
polypeptide. In another instance, the covalent associations are the consequence of 

20 chemical or recombinant manipulation. Alternatively, such covalent associations may 
involve one or more amino acid residues contained in the heterologous polypeptide 
sequence in a fusion protein of the invention. 

In one example, covalent associations are between the heterologous sequence 
contained in a fusion protein of the invention (see, e.g., US Patent Number 5,478,925). In 

25 a specific example, the covalent associations are between the heterologous sequence 

contained in an Fc fusion protein of the invention (as described herein). In another specific 
example, covalent associations of fusion proteins of the invention are between 
heterologous polypeptide sequence from another protein that is capable of forming 
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covalently associated multimers, such as for example, oseteoprotegerin (see, e.g., 
International Publication NO: WO 98/49305, the contents of which are herein incorporated 
by reference in its entirety). In another embodiment, two or more polypeptides of the 
invention are joined through peptide linkers. Examples include those peptide linkers 
described in U.S. Pat. No. 5,073,627 (hereby incorporated by reference). Proteins 
comprising multiple polypeptides of the invention separated by peptide linkers may be 
produced using conventional recombinant DNA technology. 

Another method for preparing multimer polypeptides of the invention involves use 
of polypeptides of the invention fused to a leucine zipper or isoleucine zipper polypeptide 
sequence. Leucine zipper and isoleucine zipper domains are polypeptides that promote 
multimerization of the proteins in which they are found. Leucine zippers were originally 
identified in several DNA-binding proteins (Landschulz et al., Science 240:1759, (1988)), 
and have since been found in a variety of different proteins. Among the known leucine 
zippers are naturally occurring peptides and derivatives thereof that dimerize or trimerize. 
Examples of leucine zipper domains suitable for producing soluble multimeric proteins of 
the invention are those described in PCT application WO 94/10308, hereby incorporated 
by reference. Recombinant fusion proteins comprising a polypeptide of the invention 
fused to a polypeptide sequence that dimerizes or trimerizes in solution are expressed in 
suitable host cells, and the resulting soluble multimeric fusion protein is recovered from 
the culture supernatant using techniques known in the art. 

Trimeric polypeptides of the invention may offer the advantage of enhanced 
biological activity. Preferred leucine zipper moieties and isoleucine moieties are those that 
preferentially form trimers. One example is a leucine zipper derived from lung surfactant 
protein D (SPD), as described in Hoppe et al. (FEBS Letters 344:191, (1994)) and in U.S. 
patent application Ser. No. 08/446,922, hereby incorporated by reference. Other peptides 
derived from naturally occurring trimeric proteins may be employed in preparing trimeric 
polypeptides of the invention. 
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In another example, proteins of the invention are associated by interactions 
between Flag® polypeptide sequence contained in fusion proteins of the invention 
containing Flag® polypeptide seuqence. In a further embodiment, associations proteins of 
the invention are associated by interactions between heterologous polypeptide sequence 
contained in Flag® fusion proteins of the invention and anti-Flag® antibody. 

The multimers of the invention may be generated using chemical techniques known 
in the art. For example, polypeptides desired to be contained in the multimers of the 
invention may be chemically cross-linked using linker molecules and linker molecule 
length optimization techniques known in the art (see, e.g., US Patent Number 5,478,925, 
which is herein incorporated by reference in its entirety). Additionally, multimers of the 
invention may be generated using techniques known in the art to form one or more inter- 
molecule cross-links between the cysteine residues located within the sequence of the 
polypeptides desired to be contained in the multimer (see, e.g., US Patent Number 
5,478,925, which is herein incorporated by reference in its entirety). Further, polypeptides 
of the invention may be routinely modified by the addition of cysteine or biotin to the C 
terminus or N-terminus of the polypeptide and techniques known in the art may be applied 
to generate multimers containing one or more of these modified polypeptides (see, e.g., US 
Patent Number 5,478,925, which is herein incorporated by reference in its entirety). 
Additionally, techniques known in the art may be applied to generate liposomes containing 
the polypeptide components desired to be contained in the multimer of the invention (see, 
e.g., US Patent Number 5,478,925, which is herein incorporated by reference in its 
entirety). 

Alternatively, multimers of the invention may be generated using genetic 
engineering techniques known in the art. In one embodiment, polypeptides contained in 
multimers of the invention are produced recombinantly using fusion protein technology 
described herein or otherwise known in the art (see, e.g., US Patent Number 5,478,925, 
which is herein incorporated by reference in its entirety). In a specific embodiment, 
polynucleotides coding for a homodimer of the invention are generated by ligating a 
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polynucleotide sequence encoding a polypeptide of the invention to a sequence encoding a 
linker polypeptide and then further to a synthetic polynucleotide encoding the translated 
product of the polypeptide in the reverse orientation from the original C-terminus to the N- 
terminus (lacking the leader sequence) (see, e.g., US Patent Number 5,478,925, which is 
herein incorporated by reference in its entirety). In another embodiment, recombinant 
techniques described herein or otherwise known in the art are applied to generate 
recombinant polypeptides of the invention which contain a transmembrane domain (or 
hyrophobic or signal peptide) and which can be incorporated by membrane reconstitution 
techniques into liposomes (see, e.g., US Patent Number 5,478,925, which is herein 
incorporated by reference in its entirety). 

Uses of the Polvniirl»n^Pc 

Each of the polynucleotides identified herein can be used in numerous ways as 
reagents. The following description should be considered exemplary and utilizes known 
techniques. 

The polynucleotides of the present invention are useful for chromosome 
identification. There exists an ongoing need to identify new chromosome markers, since 
few chromosome marking reagents, based on actual sequence data (repeat 
polymorphisms), are presently available. Each polynucleotide of the present invention can 
be used as a chromosome marker. 

Briefly, sequences can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp) from the sequences shown in SEQ ID NO:X. Primers can be 
selected using computer analysis so that primers do not span more than one predicted exon 
in the genomic DNA. These primers are then used for PCR screening of somatic cell 
hybrids containing individual human chromosomes. Only those hybrids containing the 
human gene corresponding to the SEQ ID NO:X will yield an amplified fragment. 

Similarly, somatic hybrids provide a rapid method of PCR mapping the 
polynucleotides to particular chromosomes. Three or more clones can be assigned per day 
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using a single thermal cycler. Moreover, sublocalization of the polynucleotides can be 
achieved with panels of specific chromosome fragments. Other gene mapping strategies 
that can be used include in situ hybridization, prescreening with labeled flow-sorted 
chromosomes, and preselection by hybridization to construct chromosome specific-cDNA 
5 libraries. 

Precise chromosomal location of the polynucleotides can also be achieved using 
fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. This 
technique uses polynucleotides as short as 500 or 600 bases; however, polynucleotides 
2,000-4,000 bp are preferred. For a review of this technique, see Verma et al., "Human 

10 Chromosomes: a Manual of Basic Techniques," Pergamon Press, New York (1988). 

For chromosome mapping, the polynucleotides can be used individually (to mark a 
single chromosome or a single site on that chromosome) or in panels (for marking multiple 
sites and/or multiple chromosomes). Preferred polynucleotides correspond to the 
noncoding regions of the cDNAs because the coding sequences are more likely conserved 

15 within gene families, thus increasing the chance of cross hybridization during 
chromosomal mapping. 

Once a polynucleotide has been mapped to a precise chromosomal location, the 
physical position of the polynucleotide can be used in linkage analysis. Linkage analysis 
establishes coinheritance between a chromosomal location and presentation of a particular 

20 disease. (Disease mapping data are found, for example, in V. McKusick, Mendelian 
Inheritance in Man (available on line through Johns Hopkins University Welch Medical 
Library) .) Assuming 1 megabase mapping resolution and one gene per 20 kb, a cDNA 
precisely localized to a chromosomal region associated with the disease could be one of 
50-500 potential causative genes. 

25 Thus, once coinheritance is established, differences in the polynucleotide and the 

corresponding gene between affected and unaffected individuals can be examined. First, 
visible structural alterations in the chromosomes, such as deletions or translocations, are 
examined in chromosome spreads or by PCR. If no structural alterations exist, the 
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presence of point mutations are ascertained. Mutations observed in some or all affected 
individuals, but not in normal individuals, indicates that the mutation may cause the 
disease. However, complete sequencing of the polypeptide and the corresponding gene 
from several normal individuals is required to distinguish the mutation from a 
polymorphism. If a new polymorphism is identified, this polymorphic polypeptide can be 
used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected individuals 
as compared to unaffected individuals can be assessed using polynucleotides of the present 
invention. Any of these alterations (altered expression, chromosomal rearrangement, or 
mutation) can be used as a diagnostic or prognostic marker. 

Thus, the invention also provides a diagnostic method useful during diagnosis of a 
disorder, involving measuring the expression level of polynucleotides of the present 
invention in cells or body fluid from an individual and comparing the measured gene 
expression level with a standard level of polynucleotide expression level, whereby an 
increase or decrease in the gene expression level compared to the standard is indicative of 
a disorder. 

In still another embodiment, the invention includes a kit for analyzing samples for 
the presence of proliferative and/or cancerous polynucleotides derived from a test subject. 
In a general embodiment, the kit includes at least one polynucleotide probe containing a 
nucleotide sequence that will specifically hybridize with a polynucleotide of the present 
invention and a suitable container. In a specific embodiment, the kit includes two 
polynucleotide probes defining an internal region of the polynucleotide of the present 
invention, where each probe has one strand containing a 31'mer-end internal to the region. 
In a further embodiment, the probes may be useful as primers for polymerase chain 
reaction amplification. 

Where a diagnosis of a disorder, has already been made according to conventional 
methods, the present invention is useful as a prognostic indicator, whereby patients 
exhibiting enhanced or depressed polynucleotide of the present invention expression will 
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experience a worse clinical outcome relative to patients expressing the gene at a level 
nearer the standard level. 

By "measuring the expression level of polynucleotide of the present invention" is 
intended qualitatively or quantitatively measuring or estimating the level of the 
polypeptide of the present invention or the level of the mRNA encoding the polypeptide in 
a first biological sample either directly (e.g., by determining or estimating absolute protein 
level or mRNA level) or relatively (e.g., by comparing to the polypeptide level or mRNA 
level in a second biological sample). Preferably, the polypeptide level or mRNA level in 
the first biological sample is measured or estimated and compared to a standard 
polypeptide level or mRNA level, the standard being taken from a second biological 
sample obtained from an individual not having the disorder or being determined by 
averaging levels from a population of individuals not having a disorder. As will be 
appreciated in the art, once a standard polypeptide level or mRNA level is known, it can be 
used repeatedly as a standard for comparison. 

By "biological sample" is intended any biological sample obtained from an 
individual, body fluid, cell line, tissue culture, or other source which contains the 
polypeptide of the present invention or mRNA. As indicated, biological samples include 
body fluids (such as semen, lymph, sera, plasma, urine, synovial fluid and spinal fluid) 
which contain the polypeptide of the present invention, and other tissue sources found to 
express the polypeptide of the present invention. Methods for obtaining tissue biopsies 
and body fluids from mammals are well known in the art. Where the biological sample is 
to include mRNA, a tissue biopsy is the preferred source. 

The method(s) provided above may preferrably be applied in a diagnostic method 
and/or kits in which polynucleotides and/or polypeptides are attached to a solid support. In 
one exemplary method, the support may be a "gene chip" or a "biological chip" as 
described in US Patents 5,837,832, 5,874,219, and 5,856,174. Further, such a gene chip 
with polynucleotides of the present invention attached may be used to identify 
polymorphisms between the polynucleotide sequences, with polynucleotides isolated from 
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a test subject. The knowledge of such polymorphisms (i.e. their location, as well as, their 
existence) would be beneficial in identifying disease loci for many disorders, including 
cancerous diseases and conditions. Such a method is described in US Patents 5,858,659 
and 5,856,104. The US Patents referenced supra are hereby incorporated by reference in 
5 their entirety herein. 

The present invention encompasses polynucleotides of the present invention that 
are chemically synthesized, or reproduced as peptide nucleic acids (PNA), or according to 
other methods known in the art. The use of PNAs would serve as the preferred form if the 
polynucleotides are incorporated onto a solid support, or gene chip. For the purposes of the 
JO present invention, a peptide nucleic acid (PNA) is a polyamide type of DNA analog and 
the monomeric units for adenine, guanine, thymine and cytosine are available 
commercially (Perceptive Biosystems). Certain components of DNA, such as phosphorus, 
phosphorus oxides, or deoxyribose derivatives, are not present in PNAs. As disclosed by P. 
E. Nielsen, M. Egholm, R. H. Berg and O. Buchardt, Science 254, J497 (1991); and M. 
15 Egholm, O. Buchardt, L.Chri S tensen, C. Behrens, S. M. Freier, D. A. Driver, R. H. Berg, 
S. K. Kim, B. Norden, and P. E. Nielsen, Nature 365, 666 (1993), PNAs bind specifically 
and tightly to complementary DNA strands and are not degraded by nucleases. In fact, 
PNA binds more strongly to DNA than DNA itself does. This is probably because there is 
no electrostatic repulsion between the two strands, and also the polyamide backbone is 
20 more flexible. Because of this, PNA/DNA duplexes bind under a wider range of stringency 
conditions than DNA/DNA duplexes, making it easier to perform multiplex hybridization. 
Smaller probes can be used than with DNA due to the strong binding. In addition, it is 
more likely that single base mismatches can be determined with PNA/DNA hybridization 
because a single mismatch in a PNA/DNA 15-mer lowers the melting point (T.sub.m) by 
25 8«-20° C, vs. 4M6° C for the DNA/DNA 15-mer duplex. Also, the absence of charge 
groups in PNA means that hybridization can be done at low ionic strengths and reduce 
possible interference by salt during the analysis. 
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The present invention is useful for detecting cancer in mammals. In particular the 
invention is useful during diagnosis of pathological cell proliferative neoplasias which 
include, but are not limited to: acute myelogenous leukemias including acute monocytic 
leukemia, acute .myeloblasts leukemia, acute promyelocytic leukemia, acute 
myelomonocytic leukemia, acute erythroleukemia, acute megakaryocyte leukemia, and 
acute undifferentiated leukemia, etc.; and chronic myelogenous leukemias including 
chronic myelomonocytic leukemia, chronic granulocytic leukemia, etc. Preferred 
mammals include monkeys, apes, cats, dogs, cows, pigs, horses, rabbits and humans. 
Particularly preferred are humans. 

Pathological cell proliferative disorders are often associated with inappropriate 
activation of proto-oncogenes. (Gelmann, E. P. et al., "The Etiology of Acute Leukemia: 
Molecular Genetics and Viral Oncology," in Neoplastic Diseases of the Blood, Vol 1., 
Wiemik, P. H. et al. eds., 161-182 (1985)). Neoplasias are now believed to result from the 
qualitative alteration of a normal cellular gene product, or from the quantitative 
modification of gene expression by insertion into the chromosome of a viral sequence, by 
chromosomal translocation of a gene to a more actively transcribed region, or by some 
other mechanism. (Gelmann et al., supra) It is likely that mutated or altered expression of 
specific genes is involved in the pathogenesis of some leukemias, among other tissues and 
cell types. (Gelmann et al., supra) Indeed, the human counterparts of the oncogenes 
involved in some animal neoplasias have been amplified or translocated in some cases of 
human leukemia and carcinoma. (Gelmann et al., supra) 

For example, c-myc expression is highly amplified in the non-lymphocytic leukemia cell 
line HL-60. When HL-60 cells are chemically induced to stop proliferation, the level of c- 
myc is found to be downregulated. (International Publication Number WO 91/15580) 
However, it has been shown that exposure of HL-60 cells to a DNA construct that is 
complementary to the 5' end of c-myc or c-myb blocks translation of the corresponding 
mRNAs which downregulates expression of the c-myc or c-myb proteins and causes arrest 
of cell proliferation and differentiation of the treated cells. (International Publication 
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Number WO 91/15580; Wickstrom et al., Proc. Natl. Acad. Sci. 85:1028 (1988); Anfossi 
et al., Proc. Natl. Acad. Sci. 86:3379 (1989)). However, the skilled artisan would 
appreciate the present invention's usefulness would not be limited to treatment of 
proliferative disorders of hematopoietic cells and tissues, in light of the numerous cells and 
5 cell types of varying origins which are known to exhibit proliferative phenotypes. 

In addition to the foregoing, a polynucleotide can be used to control gene 
expression through triple helix formation or antisense DNA or RNA. Antisense techniques 
are discussed, for example, in Okano, J. Neurochem. 56: 560 (1991); 
"Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression,CRCPress, Boca 

10 Raton, FL (1988). Triple helix formation is discussed in, for instance Lee et al., Nucleic 
Acids Research 6: 3073 (1979); Cooney et al., Science 241: 456 (1988); and Dervan et al., 
Science 251: 1360 (1991). Both methods rely on binding of the polynucleotide to a 
complementary DNA or RNA. For these techniques, preferred polynucleotides are usually 
oligonucleotides 20 to 40 bases in length and complementary to either the region of the 

15 gene involved in transcription (triple helix - see Lee et al., Nucl. Acids Res. 6:3073 (1979); 
Cooney et al., Science 241:456 (1988); and Dervan et al., Science 251:1360 (1991) ) or to 
the mRNA itself (antisense - Okano, J. Neurochem. 56:560 (1991); Oligodeoxy- 
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is to insert a new gene that was not present in the host genome, thereby producing a new 
trait in the host cell. 

The polynucleotides are also useful for identifying individuals from minute 
biological samples. The United States military, for example, is considering the use of 
restriction fragment length polymorphism (RFLP) for identification of its personnel. In 
this technique, an individual's genomic DNA is digested with one or more restriction 
enzymes, and probed on a Southern blot to yield unique bands for identifying personnel. 
This method does not suffer from the current limitations of "Dog Tags" which can be lost, 
switched, or stolen, making positive identification difficult. The polynucleotides of the 
present invention can be used as additional DNA markers for RFLP. 

The polynucleotides of the present invention can also be used as an alternative to 
RFLP, by determining the actual base-by-base DNA sequence of selected portions of an 
individual's genome. These sequences can be used to prepare PCR primers for amplifying 
and isolating such selected DNA, which can then be sequenced. Using this technique, 
individuals can be identified because each individual will have a unique set of DNA 
sequences. Once an unique ID database is established for an individual, positive 
identification of that individual, living or dead, can be made from extremely small tissue 
samples. 

Forensic biology also benefits from using DNA-based identification techniques as 
disclosed herein. DNA sequences taken from very small biological samples such as 
tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, synovial fluid, amniotic 
fluid, breast milk, lymph, pulmonary sputum or surfactant,urine,fecal matter, etc., can be 
amplified using PCR. In one prior art technique, gene sequences amplified from 
polymorphic loci, such as DQa class II HLA gene, are used in forensic biology to identify 
individuals. (Erlich, H., PCR Technology, Freeman and Co. (1992).) Once these specific 
polymorphic loci are amplified, they are digested with one or more restriction enzymes, 
yielding an identifying set of bands on a Southern blot probed with DNA corresponding to 
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the DQa class II HLA gene. Similarly, polynucleotides of the present invention can be 
used as polymorphic markers for forensic purposes. 

There is also a need for reagents capable of identifying the source of a particular 
tissue. Such need arises, for example, in forensics when presented with tissue of unknown 
5 origin. Appropriate reagents can comprise, for example, DNA probes or primers specific to 
particular tissue prepared from the sequences of the present invention. Panels of such 
reagents can identify tissue by species and/or by organ type. In a similar fashion, these 
reagents can be used to screen tissue cultures for contamination. 

In the very least, the polynucleotides of the present invention can be used as 
10 molecular weight markers on Southern gels, as diagnostic probes for the presence of a 
specific mRNA in a particular cell type, as a probe to "subtract-out" known sequences in 
the process of discovering novel polynucleotides, for selecting and making oligomers for 
attachment to a "gene chip" or other support, to raise anti-DNA antibodies using DNA 
immunization techniques, and as an antigen to elicit an immune response. 

15 

Uses of the Polypeptide 

Each of the polypeptides identified herein can be used in numerous ways. The 

following description should be considered exemplary and utilizes known techniques. 
A polypeptide of the present invention can be used to assay protein levels in a 
20 biological sample using antibody-based techniques. For example, protein expression in 

tissues can be studied with classical immunohistological methods. (Jalkanen, M., et a!., J. 

Cell. Biol. 101:976-985 (1985); Jalkanen, M, et al., J. Cell . Biol. 105:3087-3096 (1987).) 

Other antibody-based methods useful for detecting protein gene expression include 

immunoassays, such as the enzyme linked immunosorbent assay (ELISA) and the 
25 radioimmunoassay (RIA). Suitable antibody assay labels are known in the art and include 

enzyme labels, such as, glucose oxidase, and radioisotopes, such as iodine (1251, 1211), 

carbon (14C), sulfur (35S), tritium (3H), indium (1 12In), and technetium (99mTc), and 

fluorescent labels, such as fluorescein and rhodamine, and biotin. 
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In addition to assaying secreted protein levels in a biological sample, proteins can 
also be detected in vivo by imaging. Antibody labels or markers for in vivo imaging of 
protein include those detectable by X-radiography, NMR or ESR. For X-radiography, 
suitable labels include radioisotopes such as barium or cesium, which emit detectable 
radiation but are not overtly harmful to the subject. Suitable markers for NMR and ESR 
include those with a detectable characteristic spin, such as deuterium, which may be 
incorporated into the antibody by labeling of nutrients for the relevant hybridoma. 

A protein-specific antibody or antibody fragment which has been labeled with an 
appropriate detectable imaging moiety, such as a radioisotope (for example, 1311, 1 121n, 
99mTc), a radio-opaque substance, or a material detectable by nuclear magnetic resonance, 
is introduced (for example, parenterally, subcutaneously, or intraperitoneally) into the 
mammal. It will be understood in the art that the size of the subject and the imaging 
system used will determine the quantity of imaging moiety needed to produce diagnostic 
images. In the case of a radioisotope moiety, for a human subject, the quantity of 
radioactivity injected will normally range from about 5 to 20 millicuries of 99mTc. The 
labeled antibody or antibody fragment will then preferentially accumulate at the location of 
cells which contain the specific protein. In vivo tumor imaging is described in S.W. 
Burchiel et al., "Immunopharmacokinetics of Radiolabeled Antibodies and Their 
Fragments." (Chapter 13 in Tumor Imaging: The Radiochemical Detection of Cancer, 
S.W. Burchiel and B. A. Rhodes, eds., Masson Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which involves (a) 
assaying the expression of a polypeptide of the present invention in cells or body fluid of 
an individual; (b) comparing the level of gene expression with a standard gene expression 
level, whereby an increase or decrease in the assayed polypeptide gene expression level 
compared to the standard expression level is indicative of a disorder. With respect to 
cancer, the presence of a relatively high amount of transcript in biopsied tissue from an 
individual may indicate a predisposition for the development of the disease, or may 
provide a means for detecting the disease prior to the appearance of actual clinical 
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symptoms. A more definitive diagnosis of this type may allow health professionals to 
employ preventative measures or aggressive treatment earlier thereby preventing the 
development or further progression of the cancer. 

Moreover, polypeptides of the present invention can be used to treat disease. For 
example, patients can be administered a polypeptide of the present invention in an effort 
to replace absent or decreased levels of the polypeptide (e.g., insulin), to supplement 
absent or decreased levels of a different polypeptide (e.g., hemoglobin S for hemoglobin B, 
, SOD, catalase, DNA repair proteins), to inhibit the activity of a polypeptide (e.g., an 
oncogene or tumor supressor), to activate the activity of a polypeptide (e.g., by binding to a 
receptor), to reduce the activity of a membrane bound receptor by competing with it for 
free ligand (e.g., soluble TNF receptors used in reducing inflammation), or to bring about 
a desired response (e.g., blood vessel growth inhibition, enhancement of the immune 
response to proliferative cells or tissues). 

Similarly, antibodies directed to a polypeptide of the present invention can also be 
used to treat disease. For example, administration of an antibody directed to a polypeptide 
of the present invention can bind and reduce overproduction of the polypeptide. Similarly, 
administration of an antibody can activate the polypeptide, such as by binding to a 
polypeptide bound to a membrane (receptor). 

At the very least, the polypeptides of the present invention can be used as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration columns 
using methods well known to those of skill in the art. Polypeptides can also be used to 
raise antibodies, which in turn are used to measure protein expression from a recombinant 
cell, as a way of assessing transformation of the host cell. Moreover, the polypeptides of 
the present invention can be used to test the following biological activities. 
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Gene Therapy MptfinH« 

Another aspect of the present invention is to gene therapy methods for treating 
disorders, diseases and conditions. The gene therapy methods relate to the introduction of 
nucleic acid (DNA, RNA and antisense DNA or RNA) sequences into an animal to achieve 
expression of a polypeptide of the present invention. This method requires a 
polynucleotide which codes for a polypeptide of the invention that operatively linked to a 
promoter and any other genetic elements necessary for the expression of the polypeptide 
by the target tissue. Such gene therapy and delivery techniques are known in the art, see, 
for example, WO90/1 1092, which is herein incorporated by reference. 

Thus, for example, cells from a patient may be engineered with a polynucleotide 
(DNA or RNA) comprising a promoter operably linked to a polynucleotide of the 
invention ex vivo, with the engineered cells then being provided to a patient to be treated 
with the polypeptide. Such methods are well-known in the art. For example, see 
Belldegrun et al., J. Natl. Cancer Inst., 85:207-216 (1993): Ferrantini et al., Cancer 
Research, 53:107-1112 (1993); Ferrantini et al., J. Immunology 153: 4604-4615 (1994); 
Kaido, T., et al., Int. J. Cancer 60: 221-229 (1995); Ogura et al., Cancer Research 50: 
5102-5106 (1990); Santodonato, et al., Human Gene Therapy 7:1-10 (1996); Santodonato, 
et al., Gene Therapy 4:1246-1255 (1997); and Zhang, et al., Cancer Gene Therapy 3: 31-38 
(1996)), which are herein incorporated by reference. In one embodiment, the cells which 
are engineered are arterial cells. The arterial cells may be reintroduced into the patient 
through direct injection to the artery, the tissues surrounding the artery, or through catheter 
injection. 

As discussed in more detail below, the polynucleotide constructs can be delivered 
by any method that delivers injectable materials to the cells of an animal, such as, injection 
into the interstitial space of tissues (heart, muscle, skin, lung, liver, and the like). The 
polynucleotide constructs may be delivered in a pharmaceutical^ acceptable liquid or 
aqueous carrier. 
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In one embodiment, the polynucleotide of the invention is delivered as a naked 
polynucleotide. The term "naked" polynucleotide, DNA or RNA refers to sequences that 
are free from any delivery vehicle that acts to assist, promote or facilitate entry into the 
cell, including viral sequences, viral particles, liposome formulations, lipofectin or 
5 precipitating agents and the like. However, the polynucleotides of the invention can also 
be delivered in liposome formulations and lipofectin formulations and the like can be 
prepared by methods well known to those skilled in the art. Such methods are described, 
for example, in U.S. Patent Nos. 5,593,972, 5,589,466, and 5,580,859, which are herein 
incorporated by reference. 

10 The polynucleotide vector constructs of the invention used in the gene therapy 

method are preferably constructs that will not integrate into the host genome nor will they 
contain sequences that allow for replication. Appropriate vectors include pWLNEO, 
PSV2CAT, pOG44, pXTl and pSG available from Stratagene; P SVK3, pBPV, pMSG and 
pSVL available from Pharmacia; and P EF1/V5, pcDNA3.1, and pRc/CMV2 available from 
15 Invitrogen. Other suitable vectors will be readily apparent to the skilled artisan. 

Any strong promoter known to those skilled in the art can be used for driving the 
expression of polynucleotide sequence of the invention. Suitable promoters include 
adenoviral promoters, such as the adenoviral major late promoter; or heterologous 
promoters, such as the cytomegalovirus (CMV) promoter, the respiratory syncytial virus 
20 (RS V) promoter; inducible promoters, such as the MMT promoter, the metallothionein 
promoter; heat shock promoters; the albumin promoter, the ApoAI promoter; human 
globin promoters; viral thymidine kinase promoters, such as the Herpes Simplex thymidine 
kinase promoter, retroviral LTRs; the b-actin promoter, and human growth hormone 
promoters. The promoter also may be the native promoter for the polynucleotides of the 
25 invention. 

Unlike other gene therapy techniques, one major advantage of introducing naked 
nucleic acid sequences into target cells is the transitory nature of the polynucleotide 
synthesis in the cells. Studies have shown that non-replicating DNA sequences can be 
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introduced into cells to provide production of the desired polypeptide for periods of up to 
six months. 

The polynucleotide construct of the invention can be delivered to the interstitial 
space of tissues within the an animal, including of muscle, skin, brain, lung, liver, spleen, 
5 bone marrow, thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, 
stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and connective 
tissue. Interstitial space of the tissues comprises the intercellular, fluid, mucopolysaccharide 
matrix among the reticular fibers of organ tissues, elastic fibers in the walls of vessels or 
chambers, collagen fibers of fibrous tissues, or that same matrix within connective tissue 
10 ensheathing muscle cells or in the lacunae of bone. It is similarly the space occupied by the 
plasma of the circulation and the lymph fluid of the lymphatic channels. Delivery to the 
interstitial space of muscle tissue is preferred for the reasons discussed below. They may be 
conveniently delivered by injection into the tissues comprising these cells. They are 
preferably delivered to and expressed in persistent, non-dividing cells which are 
15 differentiated, although delivery and expression may be achieved in non-differentiated or 
less completely differentiated cells, such as, for example, stem cells of blood or skin 
fibroblasts. In vivo muscle cells are particularly competent in their ability to take up and 
express polynucleotides. 

For the nakednuc/eic acid sequence injection, an effective dosage amount of DNA 
20 or RN A will be in the range of from about 0.05 mg/kg body weight to about 50 mg/kg body 
weight. Preferably the dosage will be from about 0.005 mg/kg to about 20 mg/kg and more 
preferably from about 0.05 mg/kg to about 5 mg/kg. Of course, as the artisan of ordinary 
skill will appreciate, this dosage will vary according to the tissue site of injection. The 
appropriate and effective dosage of nucleic acid sequence can readily be determined by 
25 those of ordinary skill in the art and may depend on the condition being treated and the 
route of administration. 

The preferred route of administration is by the parenteral route of injection into the 
interstitial space of tissues. However, other parenteral routes may also be used, such as, 
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inhalation of an aerosol formulation particularly for delivery to lungs or bronchial tissues, 
throat or mucous membranes of the nose. In addition, naked DNA constructs can be 
delivered to arteries during angioplasty by the catheter used in the procedure. 

The naked polynucleotides are delivered by any method known in the art, 
including, but not limited to, direct needle injection at the delivery site, intravenous 
injection, topical administration, catheter infusion, and so-called "gene guns". These 
delivery methods are known in the art. 

The constructs may also be delivered with delivery vehicles such as viral 
sequences, viral particles, liposome formulations, lipofectin, precipitating agents, etc. Such 
methods of delivery are known in the art. 

In certain embodiments, the polynucleotide constructs of the invention are 
complexed in a liposome preparation. Liposomal preparations for use in the instant 
invention include cationic (positively charged), anionic (negatively charged) and neutral 
preparations. However, cationic liposomes are particularly preferred because a tight charge 
complex can be formed between the cationic liposome and the polyanionic nucleic acid. 
Cationic liposomes have been shown to mediate intracellular delivery of plasmid DNA 
(Feigner et al., Proc. Natl. Acad. Sci. USA , 84:7413-7416 (1987), which is herein 
incorporated by reference); mRNA (Malone et al., Proc. Natl. Acad. Sci. USA , 
,86:6077-6081 (1989), which is herein incorporated by reference); and purified 
transcription factors (Debs et al., J. Biol. Chem., 265:10189-10192 (1990), which is herein 
incorporated by reference), in functional form. 

Cationic liposomes are readily available. For example, 
N[I-2,3-dioleyloxy)propyl]-N^ ) N-triethylammonium (DOTMA) liposomes are 
particularly useful and are available under the trademark Lipofectin, from GIBCO BRL, 
Grand Island, N.Y. (See, also, Feigner et al., Proc. Natl Acad. Sci. USA , 84:7413-7416 
(1987), which is herein incorporated by reference). Other commercially available 
liposomes include transferee (DDAB/DOPE) and DOTAP/DOPE (Boehringer). 
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Other cationic liposomes can be prepared from readily available materials using 
techniques well known in the ait. See, e.g. PCT Publication NO: WO 90/1 1092 (which is 
herein incorporated by reference) for a description of the synthesis of DOTAP (1,2- 
bis(oleoyloxy)-3-(trimethylammonio)propane) liposomes. Preparation of DOTMA 
liposomes is explained in the literature, see, e.g., Feigner et al., Proc. Natl. Acad. Sci. 
USA, 84:7413-7417, which is herein incorporated by reference. Similar methods can be 
used to prepare liposomes from other cationic lipid materials. 

Similarly, anionic and neutral liposomes are readily available, such as from Avanti 
Polar Lipids (Birmingham, Ala.), or can be easily prepared using readily available 
materials. Such materials include phosphatidyl, choline, cholesterol, phosphatidyl 
ethanolamine, dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol 
(DOPG), dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can 
also be mixed with the DOTMA and DOTAP starting materials in appropriate ratios. 
Methods for making liposomes using these materials are well known in the art. 

For example, commercially dioleoylphosphatidyl choline (DOPC), 
dioleoylphosphatidyl glycerol (DOPG), and dioleoylphosphatidyl ethanolamine (DOPE) 
can be used in various combinations to make conventional liposomes, with or without the 
addition of cholesterol. Thus, for example, DOPG/DOPC vesicles can be prepared by 
drying 50 mg each of DOPG and DOPC under a stream of nitrogen gas into a sonication 
vial. The sample is placed under a vacuum pump overnight and is hydrated the following 
day with deionized water. The sample is then sonicated for 2 hours in a capped vial, using 
a Heat Systems model 350 sonicator equipped with an inverted cup (bath type) probe at the 
maximum setting while the bath is circulated at 15EC. Alternatively, negatively charged 
vesicles can be prepared without sonication to produce multilamellar vesicles or by 
extrusion through nucleopore membranes to produce unilamellar vesicles of discrete size. 
Other methods are known and available to those of skill in the art. 

The liposomes can comprise multilamellar vesicles (MLVs), small unilamellar 
vesicles (SUVs), or large unilamellar vesicles (LUVs), with SUVs being preferred. The 
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various liposome-nucleic acid complexes are prepared using methods well known in the 
art. See, e.g., Straubinger et al., Methods of Immunology , 101:512-527 (1983), which is 
herein incorporated by reference. For example, MLVs containing nucleic acid can be 
prepared by depositing a thin film of phospholipid on the walls of a glass tube and 
subsequently hydrating with a solution of the material to be encapsulated. SUVs are 
prepared by extended sonication of MLVs to produce a homogeneous population of 
unilamellar liposomes. The material to be entrapped is added to a suspension of preformed 
MLVs and then sonicated. When using liposomes containing cationic lipids, the dried lipid 
film is resuspended in an appropriate solution such as sterile water or an isotonic buffer 
solution such as 10 mM Tris/NaCl, sonicated, and then the preformed liposomes are mixed 
directly with the DNA. The liposome and DNA form a very stable complex due to binding 
of the positively charged liposomes to the cationic DNA. SUVs find use with small nucleic 
acid fragments. LUVs are prepared by a number of methods, well known in the art. 
Commonly used methods include Ca 2t -EDTA chelation (Papahadjopoulos et al., Biochim. 
Biophys. Acta, 394:483 (1975); Wilson et a]., Cell , 17:77 (1979)); ether injection (Deamer 
et al., Biochim. Biophys. Acta, 443:629 (1976); Ostro et al., Biochem. Biophys. Res. 
Commun., 76:836 (1977); Fraley et al., Proc. Natl. Acad. Sci. USA, 76:3348 (1979)); 
detergent dialysis (Enoch et al., Proc. Natl. Acad. Sci. USA , 76:145 (1979)); and 
reverse-phase evaporation (REV) (Fraley et al., j. Biol. Chem., 255:10431 (1980); Szoka 
et al., Proc. Natl. Acad. Sci. USA , 75:145 (1978); Schaefer-Ridder et al., Science, 215:166 
(1982)), which are herein incorporated by reference. 

Generally, the ratio of DNA to liposomes will be from about 10:1 to about 1:10. 
Preferably, the ration will be from about 5:1 to about 1:5. More preferably, the radon will 
be about 3:1 to about 1:3. Still more preferably, the ratio will be about 1:1. 

U.S. Patent NO: 5,676,954 (which is herein incorporated by reference) reports on 
the injection of genetic material, complexed with cationic liposomes carriers, into mice. 
U.S. Patent Nos. 4,897,355, 4,946,787, 5,049,386, 5,459,127, 5,589,466, 5,693,622, 
5,580,859, 5,703,055, and international publication NO: WO 94/9469 (which are herein 
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incoiporated by reference) provide cationic lipids for use in transfecting DNA into cells 
and mammals. U.S. Patent Nos. 5,589,466, 5,693,622, 5,580,859, 5,703,055, and 
international publication NO: WO 94/9469 (which are herein incorporated by reference) 
provide methods for delivering DNA-cationic lipid complexes to mammals. 
5 In certain embodiments, cells are engineered, ex vivo or in vivo, using a retroviral 

particle containing RNA which comprises a sequence encoding polypeptides of the 
invention. Retroviruses from which the retroviral plasmid vectors may be derived include, 
but are not limited to, Moloney Murine Leukemia Virus, spleen necrosis virus, Rous 
sarcoma Virus, Harvey Sarcoma Virus, avian leukosis virus, gibbon ape leukemia virus, 
10 human immunodeficiency virus, Myeloproliferative Sarcoma Virus, and mammary tumor 
virus. 

The retroviral plasmid vector is employed to transduce packaging cell lines to form 
producer cell lines. Examples of packaging cells which may be transfected include, but are 
not limited to, the PE501, PA317, R-2, R-AM, PA12, T19-14X, VT-19-17-H2, RCRE, 

1 5 RCRIP, GP+E-86, GP+en vAml2, and DAN cell lines as described in Miller, Human Gene 
Therapy , 1:5-14 (1990), which is incorporated herein by reference in its entirety. The 
vector may transduce the packaging cells through any means known in the art. Such 
means include, but are not limited to, electroporation, the use of liposomes, and CaPO, 
precipitation. In one alternative, the. retroviral plasmid vector may be encapsulated into a 

20 liposome, or coupled to a lipid, and then administered to a host. 

The producer cell line generates infectious retroviral vector particles which include 
polynucleotide encoding polypeptides of the invention. Such retroviral vector particles 
then may be employed, to transduce eukaryotic cells, either in vitro or in vivo. The 
transduced eukaryotic cells will express polypeptides of the invention. 

25 In certain other embodiments, cells are engineered, ex vivo or in vivo, with 

polynucleotides of the invention contained in an adenovirus vector. Adenovirus can be 
manipulated such that it encodes and expresses polypeptides of the invention, and at the 
same time is inactivated in terms of its ability to replicate in a normal lytic viral life cycle. 

SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 

PCT/US99/25031 

400 

Adenovirus expression is achieved without integration of the viral DNA into the host cell 
chromosome, thereby alleviating concerns about insertional mutagenesis. Furthermore, 
adenoviruses have been used as live enteric vaccines for many years with an excellent 
safety profile (Schwartzet al., Am. Rev. Respir. Dis., 109:233-238 (1974)). Finally, 
adenovirus mediated gene transfer has been demonstrated in a number of instances 
including transfer of alpha- 1 -antitrypsin and CFTR to the lungs of cotton rats (Rosenfeld et 
al.,Science , 252:431-434 (1991); Rosenfeld et al., Cell, 68:143-155 (1992)). Furthermore, 
extensive studies to attempt to establish adenovirus as a causative agent in human cancer 
were uniformly negative (Green et al. Proc. Natl. Acad. Sci. USA , 76:6606 (1979)). 

Suitable adenoviral vectors useful in the present invention are described, for 
example, in Kozarsky and Wilson, Curr. Opin. Genet. Devel., 3:499-503 (1993); Rosenfeld 
et al., Cell , 68:143-155 (1992); Engelhardt et al., Human Genet. Ther., 4:759-769 (1993); 
Yang et al., Nature Genet., 7:362-369 (1994); Wilson et al., Nature , 365:691-692 (1993); 
and U.S. Patent NO: 5,652,224, which are herein incorporated by reference. For example, 
the adenovirus vector Ad2 is useful and can be grown in human 293 cells. These cells 
contain the El region of adenovirus and constitutively express Ela and Elb, which 
complement the defective adenoviruses by providing the products of the genes deleted 
from the vector. In addition to Ad2, other varieties of adenovirus (e.g., Ad3, Ad5, and 
Ad7) are also useful in the present invention. 

Preferably, the adenoviruses used in the present invention are replication deficient. 
Replication deficient adenoviruses require the aid of a helper virus and/or packaging cell 
line to form infectious particles. The resulting virus is capable of infecting ceils and can 
express a polynucleotide of interest which is operably linked to a promoter, but cannot 
replicate in most cells. Replication deficient adenoviruses may be deleted in one or more 
of all or a portion of the following genes: Ela, El b, E3, E4, E2a, or LI through L5. 

In certain other embodiments, the cells are engineered, ex vivo or in vivo, using an 
adeno-associated virus (AAV). AAVs are naturally occurring defective viruses that 
require helper viruses to produce infectious particles (Muzyczka, Curr. Topics in 
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Microbiol. Immunol., 158:97 (1992)). It is also one of the few viruses that may integrate 
its DNA into non-dividing cells. Vectors containing as little as 300 base pairs of AAV can 
be packaged and can integrate, but space for exogenous DNA is limited to about 4.5 kb. 
Methods for producing and using such AAVs are known in the art. See, for example, U.S. 
5 Patent Nos. 5,139,941, 5,173,414, 5,354,678, 5,436,146, 5,474,935, 5,478,745, and 
5,589,377. 

For example, an appropriate AAV vector for use in the present invention will 
include all the sequences necessary for DNA replication, encapsidation, and host-cell 
integration. The polynucleotide construct containing polynucleotides of the invention is 

10 inserted into the AAV vector using standard cloning methods, such as those found in 
Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press 
(1989). The recombinant AAV vector is then transfected into packaging cells which are 
infected with a helper virus, using any standard technique, including lipofection, 
electroporation, calcium phosphate precipitation, etc. Appropriate helper viruses include 

15 adenoviruses, cytomegaloviruses, vaccinia viruses, or herpes viruses. Once the packaging 
cells are transfected and infected, they will produce infectious AAV viral particles which 
contain the polynucleotide construct of the invention. These viral particles are then used to 
transduce eukaryotic cells, either ex vivo or »i vivo. The transduced cells will contain the 
polynucleotide construct integrated into its genome, and will express the desired gene 

20 product. 

Another method of gene therapy involves operably associating heterologous control 
regions and endogenous polynucleotide sequences (e.g. encoding the polypeptide sequence 
of interest) via homologous recombination (see, e.g., U.S. Patent NO: 5,641,670, issued 
June 24, 1997; International Publication NO: WO 96/2941 1, published September 26, 
25 1996; International Publication NO: WO 94/1 2650, published August 4, 1994; Roller et 
al., Proc. Natl. Acad. Sci. USA, 86:8932-8935 (1989); and Zijlstra et al., Nature, 
342:435-438 (1989). This method involves the activation of a gene which is present in the 
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target cells, but which is not normally expressed in the cells, or is expressed at a lower 
level than desired. 

Polynucleotide constructs are made, using standard techniques known in the art, 
which contain the promoter with targeting sequences flanking the promoter. Suitable 
5 promoters are described herein. The targeting sequence is sufficiently complementary to 
an endogenous sequence to permit homologous recombination of the promoter-targeting 
sequence with the endogenous sequence. The targeting sequence will be sufficiently near 
the 5' end of the desired endogenous polynucleotide sequence so the promoter will be 
operably linked to the endogenous sequence upon homologous recombination. 
10 The promoter and the targeting sequences can be amplified using PCR. Preferably, 

the amplified promoter contains distinct restriction enzyme sites on the 5' and 3' ends. 
Preferably, the 3' end of the first targeting sequence contains the same restriction enzyme 
site as the 5' end of the amplified promoter and the 5' end of the second targeting 
sequence contains the same restriction site as the 3' end of the amplified promoter. The 
15 amplified promoter and targeting sequences are digested and ligated together. 

The promoter-targeting sequence construct is delivered to the cells, either as naked 
polynucleotide, or in conjunction with transfection-facilitating agents, such as liposomes, 
viral sequences, viral particles, whole viruses, lipofection, precipitating agents, etc., 
described in more detail above. The P promoter-targeting sequence can be delivered by 
any method, included direct needle injection, intravenous injection, topical administration, 
catheter infusion, particle accelerators, etc. The methods are described in more detail 
below. 

The promoter-targeting sequence construct is taken up by cells. Homologous 
recombination between the construct and the endogenous sequence takes place, such that 
an endogenous sequence is placed under the control of the promoter. The promoter then 
drives the expression of the endogenous sequence. 

The polynucleotides encoding polypeptides of the present invention may be 
administered along with other polynucleotides encod.ng other angiogenic proteins. 
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Angiogenic proteins include, but are not limited to, acidic and basic fibroblast growth 
factors, VEGF-1, VEGF-2 (VEGF-C), VEGF-3 (VEGF-B), epidennal growth factor alpha 
and beta, platelet-derived endothelial cell growth factor, platelet-derived growth factor, 
tumor necrosis factor alpha, hepatocyte growth factor, insulin like growth factor, colony 
stimulating factor, macrophage colony stimulating factor, granulocyte/macrophage colony 
stimulating factor, and nitric oxide synthase. 

Preferably, the polynucleotide encoding a polypeptide of the invention contains a 
secretory signal sequence that facilitates secretion of the protein. Typically, the signal 
sequence is positioned in the coding region of the polynucleotide to be expressed towards 
or at the 5' end of the coding region. The signal sequence may be homologous or 
heterologous to the polynucleotide of interest and may be homologous or heterologous to 
the cells to be transfected. Additionally, the signal sequence may be chemically 
synthesized using methods known in the art. 

Any mode of administration of any of the above-described polynucleotides 
constructs can be used so long as the mode results in the expression of one or more 
molecules in an amount sufficient to provide a therapeutic effect. This includes direct 
needle injection, systemic injection, catheter infusion, biolistic injectors, particle 
accelerators (i.e., "gene guns"), gelfoam sponge depots, other commercially available 
depot materials, osmotic pumps (e.g., Alza minipumps), oral or suppositorial solid (tablet 
or pill) pharmaceutical formulations, and decanting or topical applications during surgery. 
For example, direct injection of naked calcium phosphate-precipitated plasmid into rat 
liver and rat spleen or a protein-coated plasmid into the portal vein has resulted in gene 
expression of the foreign gene in the rat livers. (Kaneda et al., Science, 243:375 (1989)). 

A preferred method of local administration is by direct injection. Preferably, a 
recombinant molecule of the present invention complexed with a delivery vehicle is 
administered by direct injection into or locally within the area of arteries. Administration 
of a composition locally within the area of arteries refers to injecting the composition 
centimeters and preferably, millimeters within arteries. 
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Another method of local administration is to contact a polynucleotide construct of 
the present invention in or around a surgical wound. For example, a patient can undergo 
surgery and the polynucleotide construct can be coated on the surface of tissue inside the 
wound or the construct can be injected into areas of tissue inside the wound. 

Therapeutic compositions useful in systemic administration, include recombinant 
molecules of the present invention complexed to a targeted delivery vehicle of the present 
invention. Suitable delivery vehicles for use with systemic administration comprise 
liposomes comprising ligands for targeting the vehicle to a particular site. 

Preferred methods of systemic administration, include intravenous injection, 
aerosol, oral and percutaneous (topical) delivery. Intravenous injections can be performed 
using methods standard in the art. Aerosol delivery can also be performed using methods 
standard in the art (see, for example, Stribling et al., Proc. Natl. Acad. Sci. USA , 
189:11277-11281 (1992), which is incorporated herein by reference). Oral delivery can be 
performed by complexing a polynucleotide construct of the present invention to a carrier 
capable of withstanding degradation by digestive enzymes in the gut of an animal. 
Examples of such carriers, include plastic capsules or tablets, such as those known in the 
art. Topical delivery can be performed by mixing a polynucleotide construct of the present 
invention with a lipophilic reagent (e.g., DMSO) that is capable of passing into the skin. 

Determining an effective amount of substance to be delivered can depend upon a 
number of factors including, for example, the chemical structure and biological activity of 
the substance, the age and weight of the animal, the precise condition requiring treatment 
and its severity, and the route of administration. The frequency of treatments depends 
upon a number of factors, such as the amount of polynucleotide constructs administered 
per dose, as well as the health and history of the subject. The precise amount, number of 
doses, and timing of doses will be determined by the attending physician or veterinarian. 
Therapeutic compositions of the present invention can be administered to any animal, 
preferably to mammals and birds. Preferred mammals include humans, dogs, cats, mice, 
rats, rabbits sheep, cattle, horses and pigs, with humans being particularly 
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Biological Activities 

The polynucleotides or polypeptides, or agonists or antagonists of the present 
invention can be used in assays to test for one or more biological activities. If these 
polynucleotides and polypeptides do exhibit activity in a particular assay, it is likely that 
these molecules may be involved in the diseases associated with the biological activity. 
Thus, the polynucleotides or polypeptides, or agonists or antagonists could be used to treat 
the associated disease. 



10 Immune Activity 

The polynucleotides or polypeptides, or agonists or antagonists of the present 
invention may be useful in treating deficiencies or disorders of the immune system, by 
activating or inhibiting the proliferation, differentiation, or mobilization (chemotaxis) of 
immune cells. Immune cells develop through a process called hematopoiesis, producing 

15 myeloid (platelets, red blood cells, neutrophils, and macrophages) and lymphoid (B and T 
lymphocytes) cells from pluripotent stem cells. The etiology of these immune deficiencies 
or disorders may be genetic, somatic, such as cancer or some autoimmune disorders, 
acquired (e.g., by chemotherapy or toxins), or infectious. Moreover, a polynucleotides or 
polypeptides, or agonists or antagonists of the present invention can be used as a marker or 

20 detector of a particular immune system disease or disorder. 

A polynucleotides or polypeptides, or agonists or antagonists of the present 
invention may be useful in treating or detecting deficiencies or disorders of hematopoietic 
cells. A polynucleotides or polypeptides, or agonists or antagonists of the present 
invention could be used to increase differentiation and proliferation of hematopoietic cells, 

25 including the pluripotent stem cells, in an effort to treat those disorders associated with a 
decrease in certain (or many) types hematopoietic cells. Examples of immunologic 
deficiency syndromes include, but are not limited to: blood protein disorders (e.g. 
agammaglobulinemia, dysgammaglobulinemia), ataxia telangiectasia, common variable 
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immunodeficiency, Digeorge Syndrome, fflV infection, HTLV-BLV infection, leukocyte 
adhesion deficiency syndrome, lymphopenia, phagocyte bactericidal dysfunction, severe 
combined immunodeficiency (SCIDs), Wiskott-Aldrich Disorder, anemia, 
thrombocytopenia, or hemoglobinuria. 

Moreover, a polynucleotides or polypeptides, or agonists or antagonists of the 
present invention could also be used to modulate hemostatic (the stopping of bleeding) or 
thrombolytic activity (clot formation). For example, by increasing hemostatic or 
thrombolytic activity, a polynucleotides or polypeptides, or agonists or antagonists of the 
present invention could be used to treat blood coagulation disorders (e.g., afibrinogenemia, 
factor deficiencies), blood platelet disorders (e.g. thrombocytopenia), or wounds resulting 
from trauma, surgery, or other causes. Alternatively, a polynucleotides or polypeptides, lr 
agonists or antagonists of the present invention that can decrease hemostatic or 
thrombolytic activity could be used to inhibit or dissolve clotting. These molecules could 
be important in the treatment of heart attacks (infarction), strokes, or scarring. 

A polynucleotides or polypeptides, or agonists or antagonists of the present 
invention may also be useful in treating or detecting autoimmune disorders. Many 
autoimmune disorders result from inappropriate recognition of self as foreign material by 
•mmune cells. This inappropriate recognition results in an immune response leading to the 
destruction of the host tissue. Therefore, the administration of a polynucleotides or 
polypeptides, or agonists or antagonists of the present invention that inhibits an immune 
response, particularly the proliferation, differentiation, or chemotaxis of T^ells, may be an 
effective therapy in preventing autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by the present 
invention include, but are not limited to: Addison's Disease, hemolytic anemia, 
antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic encephalomyelitis, 
glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, Multiple Sclerosis, 
Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, Pemphigus, 
Polyendocrinopathies, Purpura, Reiter's Disease, Stiff-Man Syndrome, Autoimmune 
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Thyroiditis, Systemic Lupus Erythematosus, Autoimmune Pulmonary Inflammation, 
Guillain-Barre Syndrome, insulin dependent diabetes mellitis, and autoimmune 
inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly allergic 
asthma) or other respiratory problems, may also be treated by a polynucleotides or 
polypeptides, or agonists or antagonists of the present invention. Moreover, these 
molecules can be used to treat anaphylaxis, hypersensitivity to an antigenic molecule, or 
blood group incompatibility. 

A polynucleotides or polypeptides, or agonists or antagonists of the present 
invention may also be used to treat and/or prevent organ rejection or graft-versus-host 
disease (GVHD). Organ rejection occurs by host immune cell destruction of the 
transplanted tissue through an immune response. Similarly, an immune response is also 
involved in GVHD, but, in this case, the foreign transplanted immune cells destroy the host 
tissues. The administration of a polynucleotides or polypeptides, or agonists or antagonists 
of the present invention that inhibits an immune response, particularly the proliferation, 
differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing organ 
rejection or GVHD. 

Similarly, a polynucleotides or polypeptides, or agonists or antagonists of the 
present invention may also be used to modulate inflammation. For example, the 
polypeptide or polynucleotide or agonists or antagonist may inhibit the proliferation and 
differentiation of cells involved in an inflammatory response. These molecules can be 
used to treat inflammatory conditions, both chronic and acute conditions, including 
inflammation associated with infection (e.g., septic shock, sepsis, or systemic 
inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin 
lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine induced lung injury, inflammatory bowel disease, Crohn's disease, or resulting 
from over production of cytokines (e.g., TNF or IL-1.) 
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Hvru rprolifprativc DisnrHpre 

A polynucleotides or polypeptides, or agonists or antagonists of the invention can 
be used to treat or detect hyperproliferative disorders, including neoplasms. A 
polynucleotides or polypeptides, or agonists or antagonists of the present invention may 
5 inhibit the proliferation of the disorder through direct or indirect interactions. 

Alternatively, a polynucleotides or polypeptides, or agonists or antagonists of the present 
invention may proliferate other cells which can inhibit the hyperproliferative disorder. 

For example, by increasing an immune response, particularly increasing antigenic 
qualities of the hyperproliferative disorder or by proliferating, differentiating, or 
10 mobilizing T-cells, hyperproliferative disorders can be treated. This immune response may 
be increased by either enhancing an existing immune response, or by initiating a new 
immune response. Alternatively, decreasing an immune response may also be a method of 
treating hyperproliferative disorders, such as a chemotherapeutic agent. 

Examples of hyperproliferative disorders that can be treated or detected by a 
1 5 polynucleotides or polypeptides, or agonists or antagonists of the present invention 

include, but are not limited to neoplasms located in the: abdomen, bone, breast, digestive 
system, liver, pancreas, peritoneum, endocrine glands (adrenal, parathyroid, pituitary, 
testicles, ovary, thymus, thyroid), eye, head and neck, nervous (central and peripheral), 
lymphatic system, pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 
20 Similarly, other hyperproliferative disorders can also be treated or detected by a 

polynucleotides or polypeptides, or agonists or antagonists of the present invention. 
Examples of such hyperproliferative disorders include, but are not limited to: 
hypergammaglobulinemia, lymphoproliferative disorders, paraproteinemias, purpura, 
sarcoidosis, Sezary Syndrome, Waldenstrom* Macroglobulinemia, Gaucher's Disease, 
25 histiocytosis, and any other hyperproliferative disease, besides neoplasia, located in an 
organ system listed above. 
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One preferred embodiment utilizes polynucleotides of the present invention to 
inhibit aberrant cellular division, by gene therapy using the present invention, and/or 
protein fusions or fragments thereof. 

Thus, the present invention provides a method for treating cell proliferative 
disorders by inserting into an abnormally proliferating cell a polynucleotide of the present 
invention, wherein said polynucleotide represses said expression. 

Another embodiment of the present invention provides a method of treating cell- 
proliferative disorders in individuals comprising administration of one or more active gene 
copies of the present invention to an abnormally proliferating cell or cells. In a preferred 
embodiment, polynucleotides of the present invention is a DNA construct comprising a 
recombinant expression vector effective in expressing a DNA sequence encoding said 
polynucleotides. In another preferred embodiment of the present invention, the DNA 
construct encoding the poynucleotides of the present invention is inserted into cells to be 
treated utilizing a retrovirus, or more preferrably an adenoviral vector (See G J. Nabel, et. 
al., PNAS 1999 96: 324-326, which is hereby incorporated by reference). In a most 
preferred embodiment, the viral vector is defective and will not transform non-proliferating 
cells, only proliferating cells. Moreover, in a preferred embodiment, the polynucleotides 
of the present invention inserted into proliferating cells either alone, or in combination 
with or fused to other polynucleotides, can then be modulated via an external stimulus (i.e. 
magnetic, specific small molecule, chemical, or drug administration, etc.), which acts upon 
the promoter upstream of said polynucleotides to induce expression of the encoded protein 
product. As such the beneficial therapeutic affect of the present invention may be 
expressly modulated (i.e. to increase, decrease, or inhibit expression of the present 
invention) based upon said external stimulus. 

Polynucleotides of the present invention may be useful in repressing expression of 
oncogenic genes or antigens. By "repressing expression of the oncogenic genes " is 
intended the suppression of the transcription of the gene, the degradation of the gene 
transcript (pre-message RNA), the inhibition of splicing, the destruction of the messenger 
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RNA, the prevention of the post-translational modifications of the protein, the destruction 
of the protein, or the inhibition of the normal function of the protein. 

For local administration to abnormally proliferating cells, polynucleotides of the 
present invention may be administered by any method known to those of skill in the art 
including, but not limited to transfection, electroporation, microinjection of cells, or in 
vehicles such as liposomes, lipofectin, or as naked polynucleotides, or any other method 
described throughout the specification. The polynucleotide of the present invention may 
be delivered by known gene delivery systems such as, but not limited to, retroviral vectors 
(Gilboa, J. Virology 44:845 (1982); Hocke, Nature 320:275 (1986); Wilson, et al., Proc. 
Natl. Acad. Sci. U.S.A. 85:3014), vaccinia virus system (Chakrabarty et al., Mol. Cell 
Biol. 5:3403 (1985) or other efficient DNA delivery systems (Yates et al.. Nature 313:812 
(1985)) known to those skilled in the art. These references are exemplary only and are 
hereby incorporated by reference. In order to specifically deliver or transfect cells which 
are abnormally proliferating and spare non-dividing cells, it is preferable to utilize a 
retrovirus, or adenoviral (as described in the art and elsewhere herein) delivery system 
known to those of skill in the art. Since host DNA replication is required for retroviral 
DNA to integrate and the retrovirus will be unable to self replicate due to the lack of the 
retrovirus genes needed for its life cycle. Utilizing such a retroviral delivery system for 
polynucleotides of the present invention will target said gene and constructs to abnormally 
.proliferating cells and will spare the non-dividing normal cells. 

The polynucleotides of the present invention may be delivered directly to cell 
proliferative disorder/disease sites in internal organs, body cavities and the like by use of 
imaging devices used to guide an injecting needle directly to the disease site. The 
polynucleotides of the present invention may also be administered to disease sites at the 
time of surgical intervention. 

By "cell proliferative disease" is meant any human or animal disease or disorder, 
affecting any one or any combination of organs, cavities, or body parts, which is 
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characterized by single or multiple local abnormal proliferations of cells, groups of cells, 
or tissues, whether benign or malignant. 

Any amount of the polynucleotides of the present invention may be administered as 
long as it has a biologically inhibiting effect on the proliferation of the treated cells. 
5 Moreover, it is possible to administer more than one of the polynucleotide of the present 
invention simultaneously to the same site. By "biologically inhibiting" is meant partial or 
total growth inhibition as well as decreases in the rate of proliferation or growth of the 
cells. The biologically inhibitory dose may be determined by assessing the effects of the 
polynucleotides of the present invention on target malignant or abnormally proliferating 
10 cell growth in tissue culture, tumor growth in animals and cell cultures, or any other 
method known to one of ordinary skill in the art. 

The present invention is further directed to antibody-based therapies which involve 
administering of anti-polypeptides and anti-polynucleotide antibodies to a mammalian, 
preferably human, patient for treating one or more of the described disorders. Methods for 
15 producing anti-polypeptides and anti-polynucleotide antibodies polyclonal and monoclonal 
antibodies are described in detail elsewhere herein. Such antibodies may be provided in 
pharmaceutical^ acceptable compositions as known in the art or as described herein. 

A summary of the ways in which the antibodies of the present invention may be 
used therapeutically includes binding polynucleotides or polypeptides of the present 
20 invention locally or systemically in the body or by direct cytotoxicity of the antibody, e.g. 
as mediated by complement (CDC) or by effector cells (ADCC). Some of these 
approaches are described in more detail below. Armed with the teachings provided herein, 
one of ordinary skill in the art will know how to use the antibodies of the present invention 
for diagnostic, monitoring or therapeutic purposes without undue experimentation. 
25 In particular, the antibodies, fragments and derivatives of the present invention are 

useful for treating a subject having or developing cell proliferative and/or differentiation 
disorders as described herein. Such treatment comprises administering a single or multiple 
doses of the antibody, or a fragment, derivative, or a conjugate thereof. 
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The antibodies of this invention may be advantageously utilized in combination 
with other monoclonal or chimeric antibodies, or with Iymphokines or hematopoietic 
growth factors, for example, which serve to increase the number or activity of effector 
cells which interact with the antibodies. 

It is preferred to use high affinity and/or potent in vivo inhibiting and/or 
neutralizing antibodies against polypeptides or polynucleotides of the present invention, 
fragments or regions thereof, for both immunoassays directed to and therapy of disorders 
related to polynucleotides or polypeptides, including fragements thereof, of the present 
invention. Such antibodies, fragments, or regions, will preferably have an affinity for 
polynucleotides or polypeptides, including fragements thereof. Preferred binding affinities 
include those with a dissociation constant or Kd less than 5X10-*M, 10 6 M, 5X10 7 M, 10" 
7 M, 5X10»M, 10 8 M, 5X10>M, 10 9 M, 5X10 ,U M, 10"°M, 5X10"M, 10"M, 5X10 "M, 10 
,: M, SXIO-'H 10- ,J M, 5X10-"M, 10 I4 M, 5X10 15 M, and I0' 5 M. 

Moreover, polypeptides of the present invention are useful in inhibiting the 
angiogenesis of proliferative cells or tissues, either alone, as a protein fusion, or in 
combination with other polypeptides directly or indirectly, as described elsewhere herein. 
In a most preferred embodiment, said anti-angiogenesis effect may be achieved indirectly, 
for example, through the inhibition of hematopoietic, tumor-specific cells, such as tumor- 
associated macrophages (See Joseph IB, et al. J Natl Cancer Inst, 90(21): 1648-53 (1998), 
which is hereby incorporated by reference). Antibodies directed to polypeptides or 
polynucleotides of the present invention may also result in inhibition of angiogenesis 
directly, or indirectly (See Witte L, et al.. Cancer Metastasis Rev. 17(2):155-61 (1998), 
which is hereby incorporated by reference)). 

Polypeptides, including protein fusions, of the present invention, or fragments 
thereof may be useful in inhibiting proliferative cells or tissues through the induction of 
apoptosis. Said polypeptides may act either directly, or indirectly to induce apoptosis of 
proliferative cells and tissues, for example in the activation of a death-domain receptor, 
such as tumor necrosis factor (TNF) receptor-1, CD95 (Fas/APO-1), TNF-receptor-related 
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apoptosis-mediated protein (TRAMP) and TNF-related apoptosis-inducing ligand (TRAIL) 
receptor-1 and -2 (See Schulze-Osthoff K, et.al., Eur J Biochem 254(3):439-59 (1998), 
which is hereby incorporated by reference). Moreover, in another preferred embodiment 
of the present invention, said polypeptides may induce apoptosis through other 
mechanisms, such as in the activation of other proteins which will activate apoptosis, or 
through stimulating the expression of said proteins, either alone or in combination with 
small molecule drugs or adjuviants, such as apoptonin, galectins, thioredoxins, 
antiinflammatory proteins (See for example, Mutat Res 400(1 -2):447-55 (1998), Med 
Hypotheses.50(5):423-33 (1998), Chem Biol Interact. Apr 24:111-112:23-34 (1998), J Mol 
Med.76(6):402-12 (1998), Int J Tissue React;20(l):3-15 (1998), which are all hereby 
incorporated by reference). 

Polypeptides, including protein fusions to, or fragments thereof, of the present 
invention are useful in inhibiting the metastasis of proliferative cells or tissues. Inhibition 
may occur as a direct result of administering polypeptides, or antibodies directed to said 
polypeptides as described elsewere herein, or indirectly, such as activating the expression 
of proteins known to inhibit metastasis, for example alpha 4 integrins, (See, e.g., CurrTop 
Microbiol Immunol 1998;231:125-41, which is hereby incorporated by reference). Such 
thereapeutic affects of the present invention may be achieved either alone, or in 
combination with small molecule drugs or adjuvants. 

In another embodiment, the invention provides a method of delivering 
compositions containing the polypeptides of the invention (e.g., compositions containing 
polypeptides or polypeptide antibodes associated with heterologous polypeptides, 
heterologous nucleic acids, toxins, or prodrugs) to targeted cells expressing the polypeptide 
of the present invention. Polypeptides or polypeptide antibodes of the invention may be 
associated with with heterologous polypeptides, heterologous nucleic acids, toxins, or 
prodrugs via hydrophobic, hydrophilic, ionic and/or covalent interactions. 
Polypeptides, protein fusions to, or fragments thereof, of the present invention are useful in 
enhancing the immunogenicity and/or antigenicity of proliferating cells or tissues, either 
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directly, such as would occur if the polypeptides of the present invention 'vaccinated' the 
immune response to respond to proliferative antigens and immunogens, or indirectly, such 
as in activating the expression of proteins known to enhance the immune response (e.g. 
chemokines), to said antigens and immunogens. 

Cardiovascular Disorde r^ 

Polynucleotides or polypeptides, or agonists or antagonists of the invention may be 
...used to treat cardiovascular disorders, including peripheral artery disease, such as limb 
ischemia. 

Cardiovascular disorders include cardiovascular abnormalities, such as arterio- 
arterial fistula, arteriovenous fistula, cerebral arteriovenous malformations, congenital 
heart defects, pulmonary atresia, and Scimitar Syndrome. Congenital heart defects 
include aortic coarctation, cor triatriatum, coronary vessel anomalies, crisscross heart, 
dextrocardia, patent ductus arteriosus, Ebstein's anomaly, Eisenmenger complex, 
hypoplastic left heart syndrome, levocardia, tetralogy of fallot, transposition of great 
vessels, double outlet right ventricle, tricuspid atresia, persistent truncus arteriosus, and 
heart septal defects, such as aortopulmonary septal defect, endocardial cushion defects, 
Lutembacher's Syndrome, trilogy of Fallot, ventricular heart septal defects. 

Cardiovascular disorders also include heart disease, such as arrhythmias, carcinoid 
heart disease, high cardiac output, low cardiac output, cardiac tamponade, endocarditis 
(including bacterial), heart aneurysm, cardiac arrest, congestive heart failure, congestive 
cardiomyopathy, paroxysmal dyspnea, cardiac edema, heart hypertrophy, congestive 
cardiomyopathy, left ventricular hypertrophy, right ventricular hypertrophy, post-infarction 
heart rupture, ventricular septal rupture, heart valve diseases, myocardial diseases, 
myocardial ischemia, pericardial effusion, pericarditis (including constrictive and 
tuberculous), pneumopericardium, postpericardiotomy syndrome, pulmonary heart disease, 
rheumatic heart disease, ventricular dysfunction, hyperemia, cardiovascular pregnancy 
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complications, Scimitar Syndrome, cardiovascular syphilis, and cardiovascular 
tuberculosis. 

Arrhythmias include sinus arrhythmia, atrial fibrillation, atrial flutter, bradycardia, 
extrasystole, Adams-Stokes Syndrome, bundle-branch block, sinoatrial block, long QT 
5 syndrome, parasystole, Lown-Ganong-Levine Syndrome, Mahaim-type pre-excitation 
syndrome, Wolff-Parkinson-White syndrome, sick sinus syndrome, tachycardias, and 
ventricular fibrillation. Tachycardias include paroxysmal tachycardia, supraventricular 
tachycardia, accelerated idioventricular rhythm, atrioventricular nodal reentry tachycardia, 
ectopic atrial tachycardia, ectopic junctional tachycardia, sinoatrial nodal reentry 
10 tachycardia, sinus tachycardia, Torsades de Pointes, and ventricular tachycardia. 

Heart valve disease include aortic valve insufficiency, aortic valve stenosis, hear 
murmurs, aortic valve prolapse, mitral valve prolapse, tricuspid valve prolapse, mitral 
valve insufficiency, mitral valve stenosis, pulmonary atresia, pulmonary valve 
insufficiency, pulmonary valve stenosis, tricuspid atresia, tricuspid valve insufficiency, and 
15 tricuspid valve stenosis. 

Myocardial diseases include alcoholic cardiomyopathy, congestive 
cardiomyopathy, hypertrophic cardiomyopathy, aortic subvalvular stenosis, pulmonary 
subvalvular stenosis, restrictive cardiomyopathy, Chagas cardiomyopathy, endocardial 
fibroelastosis, endomyocardial fibrosis, Keams Syndrome, myocardial reperfusion injury, 
20 and myocarditis. 

Myocardial ischemias include coronary disease, such as angina pectoris, coronary 
aneurysm, coronary arteriosclerosis, coronary thrombosis, coronary vasospasm, 
myocardial infarction and myocardial stunning. 

Cardiovascular diseases also include vascular diseases such as aneurysms, 
25 angiodysplasia, angiomatosis, bacillary angiomatosis, Hippel-Lindau Disease, Klippel- 
Trenaunay-Weber Syndrome, Sturge-Weber Syndrome, angioneurotic edema, aortic 
diseases, Takayasu's Arteritis, aortitis, Leriche's Syndrome, arterial occlusive diseases, 
arteritis, enarteritis, polyarteritis nodosa, cerebrovascular disorders, diabetic angiopathies, 
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diabetic retinopathy, embolisms, thrombosis, erythromelalgia, hemorrhoids, hepatic veno- 
occlusive disease, hypertension/hypotension, ischemia, peripheral vascular diseases, 
phlebitis, pulmonary veno-occlusive disease, Raynaud's disease, CREST syndrome, retina! 
vein occlusion, Scimitar syndrome, superior vena cava syndrome, telangiectasia, atacia 
5 telangiectasia, hereditary hemorrhagic telangiectasia, varicocele, varicose veins, varicose 
ulcer, vasculitis, and venous insufficiency. 

Aneurysms include dissecting aneurysms, false aneurysms, infected aneurysms, 
ruptured aneurysms, aortic aneurysms, cerebral aneurysms, coronary aneurysms, heart 
aneurysms, and iliac aneurysms. 

10 Arterial occlusive diseases include arteriosclerosis, intermittent claudication, 

carotid stenosis, fibromuscular dysplasias, mesenteric vascular occlusion, Moyamoya 
disease, renal artery obstruction, retinal artery occlusion, and thromboangiitis obliterans. 

Cerebrovascular disorders include carotid artery diseases, cerebral amyloid 
angiopathy, cerebral aneurysm, cerebral anoxia, cerebral arteriosclerosis, cerebral 

15 arteriovenous malformation, cerebral artery diseases, cerebral embolism and thrombosis, 
carotid artery thrombosis, sinus thrombosis, Wallenberg's syndrome, cerebral hemorrhage, 
epidural hematoma, subdural hematoma, subaraxhnoid hemorrhage, cerebral infarction, 
cerebral ischemia (including transient), subclavian steal syndrome, periventricular 
leukomalacia, vascular headache, cluster headache, migraine, and vertebrobasilar 
20 insufficiency. 

Embolisms include air embolisms, amniotic fluid embolisms, cholesterol 
embolisms, blue toe syndrome, fat embolisms, pulmonary embolisms, and 
thromoboembolisms. Thrombosis include coronary thrombosis, hepatic vein thrombosis, 
retinal vein occlusion, carotid artery thrombosis, sinus thrombosis, Wallenberg's syndrome, 
25 and thrombophlebitis. 

Ischemia includes cerebral ischemia, ischemic colitis, compartment syndromes, 
anterior compartment syndrome, myocardial ischemia, reperfusion injuries, and peripheral 
limb ischemia. Vasculitis includes aortitis, arteritis, Behcet's Syndrome, Churg-Strauss 
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Syndrome, mucocutaneous lymph node syndrome, thromboangiitis obliterans, 
hypersensitivity vasculitis, Schoenlein-Henoch purpura, allergic cutaneous vasculitis, and 
Wegener's granulomatosis. 

Polynucleotides or polypeptides, or agonists or antagonists of the invention, are 
5 especially effective for the treatment of critical limb ischemia and coronary disease. 

Polypeptides may be administered using any method known in the art, including, 
but not limited to, direct needle injection at the delivery site, intravenous injection, topical 
administration, catheter infusion, biolistic injectors, particle accelerators, gelfoam sponge 
depots, other commercially available depot materials, osmotic pumps, oral or suppositorial 
10 solid pharmaceutical formulations, decanting or topical applications during surgery, 
aerosol delivery. Such methods are known in the art. Polypeptides of the invention may be 
administered as part of a Therapeutic, described in more detail below. Methods of 
delivering polynucleotides of the invention are described in more detail herein. 

15 Anti-Angio genesis Activity 

The naturally occurring balance between endogenous stimulators and inhibitors of 
angiogenesis is one in which inhibitory influences predominate. Rastinejad et ai. Cell 
56:345-355 (1989). In those rare instances in which neovascularization occurs under 
normal physiological conditions, such as wound healing, organ regeneration, embryonic 

20 development, and female reproductive processes, angiogenesis is stringently regulated and 
spatially and temporally delimited. Under conditions of pathological angiogenesis such as 
that characterizing solid tumor growth, these regulatory controls fail. Unregulated 
angiogenesis becomes pathologic and sustains progression of many neoplastic and non- 
neoplastic diseases. A number of serious diseases are dominated by abnormal 

25 neovascularization including solid tumor growth and metastases, arthritis, some types of 
eye disorders, and psoriasis. See, e.g., reviews by Moses et ai, Biotech. 9:630-634 (1991); 
Folkman et ai, N. Engl. J. Med., 5J5:1757-1763 (1995); Auerbach et ai, J. Microvasc. 
Res. 29:401-411 (1985); Folkman, Advances in Cancer Research, eds. Klein and 
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Weinhouse, Academic Press, New York, pp. 175-203 (1985); Patz, Am. J. Opthalmol. 
9*715-743 (1982); and Folkman et al, Science 227:719-725 (1983). In a number of 
pathological conditions, the process of angiogenesis contributes to the disease state. For 
example, significant data have accumulated which suggest that the growth of solid tumors 
is dependent on angiogenesis. Folkman and Klagsbrun, Science 235:442-441 (1987). 

The present invention provides for treatment of diseases or disorders associated 
with neovascularization by administration of the polynucleotides and/or polypeptides of 
the invention, as well as agonists or antagonists of the present invention. Malignant and 
metastatic conditions which can be treated with the polynucleotides and polypeptides, or 
agonists or antagonists of the invention include, but are not limited to, malignancies, solid 
tumors, and cancers described herein and otherwise known in the art (for a review of such 
disorders, see Fishman et al., Medicine, 2d Ed., J. B. Lippincott Co., Philadelphia 
(1985)).Thus, the present invention provides a method of treating an angiogenesis-related 
disease and/or disorder, comprising administering to an individual in need thereof a 
therapeutically effective amount of a polynucleotide, polypeptide, antagonist and/or 
agonist of the invention. For example, polynucleotides, polypeptides, antagonists and/or 
agonists may be utilized in a variety of additional methods in order to therapeutically treat 
a cancer or tumor. Cancers which may be treated with polynucleotides, polypeptides, 
antagonists and/or agonists include, but are not limited to solid tumors, including prostate, 
: lung, breast, ovarian, stomach, pancreas, larynx, esophagus, testes, liver, parotid, biliary 
tract, colon, rectum, cervix, uterus, endometrium, kidney, bladder, thyroid cancer; primary 
tumors and metastases; melanomas; glioblastoma; Kaposi's sarcoma; leiomyosarcoma; 
non- small cell lung cancer; colorectal cancer; advanced malignancies; and blood bom 
tumors such as leukemias. For example, polynucleotides, polypeptides, antagonists and/or 
agonists may be delivered topically, in order to treat cancers such as skin cancer, head and 
neck tumors, breast tumors, and Kaposi's sarcoma. 

Within yet other aspects, polynucleotides, polypeptides, antagonists and/or agonists 
may be utilized to treat superficial forms of bladder cancer by, for example, intravesical 
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administration. Polynucleotides, polypeptides, antagonists and/or agonists may be 
delivered directly into the tumor, or near the tumor site, via injection or a catheter. Of 
course, as the artisan of ordinary skill will appreciate, the appropriate mode of 
administration will vary according to the cancer to be treated. Other modes of delivery are 
5 discussed herein. 

Polynucleotides, polypeptides, antagonists and/or agonists may be useful in treating 
other disorders, besides cancers, which involve angiogenesis. These disorders include, but 
are not limited to: benign tumors, for example hemangiomas, acoustic neuromas, 
neurofibromas, trachomas, and pyogenic granulomas; artheroscleric plaques; ocular 

10 angiogenic diseases, for example, diabetic retinopathy, retinopathy of prematurity, macular 
degeneration, corneal graft rejection, neovascular glaucoma, retrolental fibroplasia, 
rubeosis, retinoblastoma, uvietis and Pterygia (abnormal blood vessel growth) of the eye; 
rheumatoid arthritis; psoriasis; delayed wound healing; endometriosis; vasculogenesis; 
granulations; hypertrophic scars (keloids); nonunion fractures; scleroderma; trachoma; 

15 vascular adhesions; myocardial angiogenesis; coronary collaterals; cerebral collaterals; 
arteriovenous malformations; ischemic limb angiogenesis; Osier-Webber Syndrome; 
plaque neovascularization; telangiectasia; hemophiliac joints; angiofibroma; fibromuscular 
dysplasia; wound granulation; Crohn's disease; and atherosclerosis. 

For example, within one aspect of the present invention methods are provided for 

20 treating hypertrophic scars and keloids, comprising the step of administering a 
polynucleotide, polypeptide, antagonist and/or agonist of the invention to a hypertrophic 
scar or keloid. 

Within one embodiment of the present invention polynucleotides, polypeptides, 
antagonists and/or agonists are directly injected into a hypertrophic scar or keloid, in order 
25 to prevent the progression of these lesions. This therapy is of particular value in the 
prophylactic treatment of conditions which are known to result in the development of 
hypertrophic scars and keloids (e.g., burns), and is preferably initiated after the 
proliferative phase has had time to progress (approximately 14 days after the initial injury), 
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but before hypertrophic scar or keloid development. As noted above, the present invention 
also provides methods for treating neovascular diseases of the eye, including for example, 
corneal neovascularization, neovascular glaucoma, proliferative diabetic retinopathy, 
retrolental fibroplasia and macular degeneration. 

Moreover, Ocular disorders associated with neovascularization which can be 
treated with the polynucleotides and polypeptides of the present invention (including 
agonists and/or antagonists) include, but are not limited to: neovascular glaucoma, diabetic 
retinopathy, retinoblastoma, retrolental fibroplasia, uveitis, retinopathy of prematurity 
macular degeneration, corneal graft neovascularization, as well as other eye inflammatory 
diseases, ocular tumors and diseases associated with choroidal or iris neovascularization. 
See, e.g., reviews by Waltman et ai, Am. J. Ophthal. 55:704-710 (1978) and Gartner et 
al, Surv. Ophthal. 22:291-312 (1978). 

Thus, within one aspect of the present invention methods are provided for treating 
neovascular diseases of the eye such as corneal neovascularization (including corneal graft 
neovascularization), comprising the step of administering to a patient a therapeutically 
effective amount of a compound (as described above) to the cornea, such that the 
formation of blood vessels is inhibited. Briefly, the cornea is a tissue which normally lacks 
blood vessels. In certain pathological conditions however, capillaries may extend into the 
cornea from the pericorneal vascular plexus of the limbus. When the cornea becomes 
vascularized, it also becomes clouded, resulting in a decline in the patienfs visual acuity. 
Visual loss may become complete if the cornea completely opacitates. A wide variety of 
disorders can result in corneal neovascularization, including for example, corneal 
infections (e.g., trachoma, herpes simplex keratitis, leishmaniasis and onchocerciasis), 
immunological processes (e.g., graft rejection and Stevens-Johnson's syndrome), alkali 
bums, trauma, inflammation (of any cause), toxic and nutritional deficiency states, and as a 
complication of wearing contact lenses. 

Within particularly preferred embodiments of the invention, may be prepared for 
topical administration in saline (combined with any of the preservatives and antimicrobial 
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agents commonly used in ocular preparations), and administered in eyedrop form. The 
solution or suspension may be prepared in its pure form and administered several times 
daily. Alternatively, anti -angiogenic compositions, prepared as described above, may also 
be administered directly to the cornea. Within preferred embodiments, the anti-angiogenic 
5 composition is prepared with a muco-adhesive polymer which binds to comea. Within 
further embodiments, the anti-angiogenic factors or anti-angiogenic compositions may be 
utilized as an adjunct to conventional steroid therapy. Topical therapy may also be useful 
prophylactically in corneal lesions which are known to have a high probability of inducing 
an angiogenic response (such as chemical burns). In these instances the treatment, likely 
10 in combination with steroids, may be instituted immediately to help prevent subsequent 
complications. 

Within other embodiments, the compounds described above may be injected 
directly into the corneal stroma by an ophthalmologist under microscopic guidance. The 
preferred site of injection may vary with the morphology of the individual lesion, but the 

15 goal of the administration would be to place the composition at the advancing front of the 
vasculature (i.e., interspersed between the blood vessels and the normal cornea). In most 
cases this would involve perilimbic corneal injection to "protect" the comea from the 
advancing blood vessels. This method may also be utilized shortly after a corneal insult in 
order to prophylactically prevent corneal neovascularization. In this situation the material 

20 could be injected in the perilimbic cornea interspersed between the corneal lesion and its 
undesired potential limbic blood supply. Such methods may also be utilized in a similar 
fashion to prevent capillary invasion of transplanted corneas. In a sustained-release form 
injections might only be required 2-3 times per year. A steroid could also be added to the 
injection solution to reduce inflammation resulting from the injection itself. 

25 Within another aspect of the present invention, methods are provided for treating 

neovascular glaucoma, comprising the step of administering to a patient a therapeutically 
effective amount of a polynucleotide, polypeptide, antagonist and/or agonist to the eye, 
such that the formation of blood vessels is inhibited. In one embodiment, the compound 
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may be administered topically to the eye in order to treat early forms of neovascular 
glaucoma. Within other embodiments, the compound may be implanted by injection into 
the region of the anterior chamber angle. Within other embodiments, the compound may 
also be placed in any location such that the compound is continuously released into the 
5 aqueous humor. Within another aspect of the present invention, methods are provided for 
treating proliferative diabetic retinopathy, comprising the step of administering to a patient 
a therapeutically effective amount of a polynucleotide, polypeptide, antagonist and/or 
agonist to the eyes, such that the formation of blood vessels is inhibited. 

Within particularly preferred embodiments of the invention, proliferative diabetic 
10 retinopathy may be treated by injection into the aqueous humor or the vitreous, in order to 
increase the local concentration of the polynucleotide, polypeptide, antagonist and/or 
agonist in the retina. Preferably, this treatment should be initiated prior to the acquisition 
of severe disease requiring photocoagulation. 

Within another aspect of the present invention, methods are provided for treating 
15 retrolental fibroplasia, comprising the step of administering to a patient a therapeutically 
effective amount of a polynucleotide, polypeptide, antagonist and/or agonist to the eye, 
such that the formation of blood vessels is inhibited. The compound may be administered 
topically, via intravitreous injection and/or via intraocular implants. 

Additionally, disorders which can be treated with the polynucleotides, 
20 polypeptides, agonists and/or agonists include, but are not limited to, hemangioma, 
arthritis, psoriasis, angiofibroma, atherosclerotic plaques, delayed wound healing, 
granulations, hemophilic joints, hypertrophic scars, nonunion fractures, Osier-Weber 
syndrome, pyogenic granuloma, scleroderma, trachoma, and vascular adhesions. 

Moreover, disorders and/or states, which can be treated with be treated with the the 
25 polynucleotides, polypeptides, agonists and/or agonists include, but are not limited to, solid 
tumors, blood born tumors such as leukemias, tumor metastasis, Kaposi's sarcoma, benign 
tumors, for example hemangiomas, acoustic neuromas, neurofibromas, trachomas, and 
pyogenic granulomas, rheumatoid arthritis, psoriasis, ocular angiogenic diseases, for 
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example, diabetic retinopathy, retinopathy of prematurity, macular degeneration, corneal 
graft rejection, neovascular glaucoma, retrolental fibroplasia, rubeosis, retinoblastoma, and 
uvietis, delayed wound healing, endometriosis, vascluogenesis, granulations, hypertrophic 
scars (keloids), nonunion fractures, scleroderma, trachoma, vascular adhesions, myocardial 
angiogenesis, coronary collaterals, cerebral collaterals, arteriovenous malformations, 
ischemic limb angiogenesis, Osier-Webber Syndrome, plaque neovascularization, 
telangiectasia, hemophiliac joints, angiofibroma fibromuscular dysplasia, wound 
granulation, Crohn* disease, atherosclerosis, birth control agent by preventing 
vascularization required for embryo implantation controlling menstruation, diseases that 
have angiogenesis as a pathologic consequence such as cat scratch disease (Rochele 
minalia quintosa), ulcers (Helicobacter pylori), Bartonellosis and bacillary angiomatosis. 

In one aspect of the birth control method, an amount of the compound sufficient to 
block embryo implantation is administered before or after intercourse and fertilization have 
occurred, thus providing an effective method of birth control, possibly a '•morning after- 
method. Polynucleotides, polypeptides, agonists and/or agonists may also be used in 
controlling menstruation or administered as either a peritoneal lavage fluid or for 
peritonea] implantation in the treatment of endometriosis. 

Polynucleotides, polypeptides, agonists and/or agonists of the present invention 
may be incorporated into surgical sutures in order to prevent stitch granulomas. 

Polynucleotides, polypeptides, agonists and/or agonists may be utilized in a wide 
variety of surgical procedures. For example, within one aspect of the present invention a 
compositions (in the form of, for example, a spray or film) may be utilized to coat or spray 
an area prior to removal of a tumor, in order to isolate normal surrounding tissues from 
malignant tissue, and/or to prevent the spread of disease to surrounding tissues. Within 
other aspects of the present invention, compositions (e.g., in the form of a spray) may be 
delivered via endoscopic procedures in order to coat tumors, or inhibit angiogenesis in a 
desired locale. Within yet other aspects of the present invention, surgical meshes which 
have been coated with anti- angiogenic compositions of the present invention may be 
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utilized in any procedure wherein a surgical mesh might be utilized. For example, within 
one embodiment of the invention a surgical mesh laden with an anti-angiogenic 
composition may be utilized during abdominal cancer resection surgery (e.g., subsequent 
to colon resection) in order to provide support to the structure, and to release an amount of 
the anti-angiogenic factor. 

Within further aspects of the present invention, methods are provided for treating 
tumor excision sites, comprising administering a polynucleotide, polypeptide, agonist 
and/or agonist to the resection margins of a tumor subsequent to excision, such that the 
local recurrence of cancer and the formation of new blood vessels at the site is inhibited. 
Within one embodiment of the invention, the anti-angiogenic compound is administered 
directly to the tumor excision site (e.g., applied by swabbing, brushing or otherwise 
coating the resection margins of the tumor with the anti-angiogenic compound). 
Alternatively, the anti-angiogenic compounds may be incorporated into known surgical 
pastes prior to administration. Within particularly preferred embodiments of the invention, 
the anti-angiogenic compounds are applied after hepatic resections for malignancy, and 
after neurosurgical operations. 

Within one aspect of the present invention, polynucleotides, polypeptides, agonists 
and/or agonists may be administered to the resection margin of a wide variety of tumore, 
including for example, breast, colon, brain and hepatic tumors. For example, within one 
embodiment of the invention, anti-angiogenic compounds may be administered to the site 
of a neurological tumor subsequent to excision, such that the formation of new blood 
vessels at the site are inhibited. 

The polynucleotides, polypeptides, agonists and/or agonists of the present invention 
may also be administered along with other anti-angiogenic factors. Representative 
examples of other anti-angiogenic factors include: Anti-Invasive Factor, retinoic acid and 
derivatives thereof, paclitaxel, Suramin, Tissue Inhibitor of Metalloproteinase-1, Tissue 
Inhibitor or Metalloproteinase-2, Plasminogen Activator Inhibitor-1, Plasminogen 
Activator Inhibitor-2, and various forms of the lighter "d group" transition metals. 
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Lighter "d group" transition metals include, for example, vanadium, molybdenum, 
tungsten, titanium, niobium, and tantalum species. Such transition metal species may form 
transition metal complexes. Suitable complexes of the above-mentioned transition metal 
species include oxo transition metal complexes. 
5 Representative examples of vanadium complexes include oxo vanadium complexes 

such as vanadate and vanadyl complexes. Suitable vanadate complexes include 
metavanadate and orthovanadate complexes such as, for example, ammonium 
metavanadate, sodium metavanadate, and sodium orthovanadate. Suitable vanadyl 
complexes include, for example, vanadyl acetylacetonate and vanadyl sulfate including 

10 vanadyl sulfate hydrates such as vanadyl sulfate mono- and trihydrates. 

Representative examples of tungsten and molybdenum complexes also include oxo 
complexes. Suitable oxo tungsten complexes include tungstate and tungsten oxide 
complexes. Suitable tungstate complexes include ammonium tungstate, calcium tungstate, 
sodium tungstate dihydrate, and tungstic acid. Suitable tungsten oxides include tungsten 

15 (IV) oxide and tungsten (VI) oxide. Suitable oxo molybdenum complexes include 
molybdate, molybdenum oxide, and molybdenyl complexes. Suitable molybdate 
complexes include ammonium molybdate and its hydrates, sodium molybdate and its 
hydrates, and potassium molybdate and its hydrates. Suitable molybdenum oxides include 
molybdenum (VI) oxide, molybdenum (VI) oxide, and molybdic acid. Suitable 

20 molybdenyl complexes include, for example, molybdenyl acetylacetonate. Other suitable 
tungsten and molybdenum complexes include hydroxo derivatives derived from, for 
example, glycerol, tartaric acid, and sugars. 

A wide variety of other anti-angiogenic factors may also be utilized within the 
context of the present invention. Representative examples include platelet factor 4; 

25 protamine sulphate; sulphated chitin derivatives (prepared from queen crab shells), 
(Murata et al., Cancer Res. 51:22-26, 1991); Sulphated Polysaccharide Peptidoglycan 
Complex (SP- PG) (the function of this compound may be enhanced by the presence of 
steroids such as estrogen, and tamoxifen citrate); Staurosporine; modulators of matrix 
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metabolism, including for example, proline analogs, cishydroxyproline, d,L-3,4- 
dehydroproline, Thiaproline, alpha,alpha-dipyridyl, aminopropionitrile fumarate; 4-propyl- 
5-(4-pyridinyI)-2(3H)-oxazolone; Methotrexate; Mitoxantrone; Heparin; Interferons; 2 
Macroglobulin-serum; ChIMP-3 (Pavloff et al., J. Bio. Chem. 267:17321-17326, 1992); 
5 Chymostatin (Tomkinson et al., Biochem J. 286:475-480, 1992); Cyclodextrin 
Tetradecasulfate; Eponemycin; Camptothecin; Fumagillin (Ingber et al., Nature 348:555- 
557, 1990); Gold Sodium Thiomalate ("GST"; Matsubara and Ziff, j. Clin. Invest. 
79:1440-1446, 1987); anticollagenase-serum; alpha2-antiplasmin (Holmes et al., J. Biol. 
-Chem. 262(4):1659-1664, 1987); Bisantrene (National Cancer Institute); Lobenzarit 
10 disodium (N-(2)-carboxyphenyl-4- chloroanthronilic acid disodium or "CCA"; Takeuchi et 
al., Agents Actions 36:312-316, 1992); Thalidomide; Angostatic steroid; AGM-1470; 
carboxynaminolmidazole; and metalloproteinase inhibitors such as BB94. 



15 



Diseases at the Cellular I » w l 

Diseases associated with increased cell survival or the inhibition of apoptosis that 
could be treated or detected by the polynucleotides or polypeptides and/or antagonists or 
agonists of the invention, include cancers (such as follicular lymphomas, carcinomas with 
P 53 mutations, and hormone-dependent tumors, including, but not limited to colon cancer, 
cardiac tumors, pancreatic cancer, melanoma, retinoblastoma, glioblastoma, lung cancer, 
20 intestinal cancer, testicular cancer, stomach cancer, neuroblastoma, myxoma, myoma, 
lymphoma, endothelioma, osteoblastoma, osteoclastoma, osteosarcoma, chondrosarcoma, 
adenoma, breast cancer, prostate cancer, Kaposi's sarcoma and ovarian cancer); 
autoimmune disorders (such as, multiple sclerosis, Sjogren's syndrome, Hashimoto's 
thyroiditis, biliary cirrhosis, Behcet's disease, Crohn's disease, polymyositis, systemic 
25 lupus erythematosus and immune-related glomerulonephritis and rheumatoid arthritis) and 
viral infections (such as herpes viruses, pox viruses and adenoviruses), inflammation, graft 
v. host disease, acute graft rejection, and chronic graft rejection. In preferred 
embodiments, the polynucleotides or polypeptides, and/or agonists or antagonists of the 
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invention are used to inhibit growth, progression, and/or metasis of cancers, in particular 
those listed above. 

Additional diseases or conditions associated with increased cell survival that could 
be treated or detected by the polynucleotides or polypeptides, or agonists or antagonists of 
5 the invention, include, but are not limited to, progression, and/or metastases of 
malignancies and related disorders such as leukemia (including acute leukemias (e.g., 
acute lymphocytic leukemia, acute myelocytic leukemia (including myeloblasts, 
promyelocyte, myelomonocytic, monocytic, and erythroleukemia)) and chronic leukemias 
(e.g., chronic myelocytic (granulocytic) leukemia and chronic lymphocytic leukemia)), 

10 polycythemia vera, lymphomas (e.g. ? Hodgkin's disease and non-Hodgkin's disease), 
multiple myeloma, Waldenstrom's macroglobulinemia, heavy chain disease, and solid 
tumors including, but not limited to, sarcomas and carcinomas such as fibrosarcoma, 
myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, 
angiosarcoma, endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, 

15 synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon 
carcinoma, pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell 
carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland 
carcinoma, papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, 
medullary carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct 

20 carcinoma, choriocarcinoma, seminoma, embryonal carcinoma, Wilm's tumor, cervical 
cancer, testicular tumor, lung carcinoma, small cell lung carcinoma, bladder carcinoma, 
epithelial carcinoma, glioma, astrocytoma, medulloblastoma, craniopharyngioma, 
ependymoma, pinealoma, hemangioblastoma, acoustic neuroma, oligodendroglioma, 
menangioma, melanoma, neuroblastoma, and retinoblastoma. 

25 Diseases associated with increased apoptosis that could be treated or detected by 

the polynucleotides or polypeptides, and/or agonists or antagonists of the invention, 
include AIDS; neurodegenerative disorders (such as Alzheimer's disease, Parkinson's 
disease, Amyotrophic lateral sclerosis, Retinitis pigmentosa, Cerebellar degeneration and 
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brain tumor or prior associated disease); autoimmune disorders (such as, multiple sclerosis, 
Sjogren's syndrome, Hashimoto's thyroiditis, biliary cirrhosis, Behcet's disease, Crohn's 
disease, polymyositis, systemic lupus erythematosus and immune-related 
glomerulonephritis and rheumatoid arthritis) myelodysplasia syndromes (such as aplastic 
anemia), graft v. host disease, ischemic injury (such as that caused by myocardial 
infarction, stroke and reperfusion injury), liver injury (e.g., hepatitis related liver injury, 
ischemia/reperfusion injury, cholestosis (bile duct injury) and liver cancer); toxin-induced 
liver disease (such as that caused by alcohol), septic shock, cachexia and anorexia. 

Wound Healing and Fpit Mial Cell P^ lif^;™ 

In accordance with yet a further aspect of the present invention, there is provided a 
process for utilizing the polynucleotides or polypeptides, and/or agonists or antagonists of 
the invention, for therapeutic purposes, for example, to stimulate epithelial cell 
proliferation and basal keratinocytes for the purpose of wound healing, and to stimulate 
hair fol licle production and healing of dermal wounds. Polynucleotides or polypeptides, as 
well as agonists or antagonists of the invention, may be clinically useful in stimulating 
wound healing including surgical wounds, excisional wounds, deep wounds involving 
damage of the dermis and epidermis, eye tissue wounds, dental tissue wounds, oral cavity 
wounds, diabetic ulcers, dermal ulcers, cubitus ulcers, arterial ulcers, venous stasis ulcers, 
.burns resulting from heat exposure or chemicals, and other abnormal wound healing 
conditions such as uremia, malnutrition, vitamin deficiencies and complications associted 
with systemic treatment with steroids, radiation therapy and antineoplastic drugs and 
antimetabolites. Polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could be used to promote dermal reestablishment subsequent to dermal loss 

The polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could be used to increase the adherence of skin grafts to a wound bed and to 
stimulate re-epithelialization from the wound bed. . The following are a non-exhaustive list 
of grafts that polynucleotides or polypeptides, agonists or antagonists of the invention, 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 „^.„_ 

PCT/US99/2503I 

429 

could be used to increase adherence to a wound bed: autografts, artificial skin, allografts, 
autodermic graft, autoepdermic grafts, avacular grafts, Blair-Brown grafts, bone graft, 
brephoplastic grafts, cutis graft, delayed graft, dermic graft, epidermic graft, fascia graft, 
full thickness graft, heterologous graft, xenograft, homologous graft, hyperplastic graft, 
lamellar graft, mesh graft, mucosal graft, Ollier-Thiersch graft, omenpal graft, patch graft, 
pedicle graft, penetrating graft, split skin graft, thick split graft. The polynucleotides or 
polypeptides, and/or agonists or antagonists of the invention, can be used to promote skin 
strength and to improve the appearance of aged skin. 

It is believed that the polynucleotides or polypeptides, and/or agonists or 
antagonists of the invention, will also produce changes in hepatocyte proliferation, and 
epithelial cell proliferation in the lung, breast, pancreas, stomach, small intesting, and large 
intestine. The polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could promote proliferation of epithelial cells such as sebocytes, hair follicles, 
hepatocytes, type II pneumocytes, mucin-producing goblet cells, and other epithelial cells 
and their progenitors contained within the skin, lung, liver, and gastrointestinal tract. The 
polynucleotides or polypeptides, and/or agonists or antagonists of the invention, may 
promote proliferation of endothelial cells, keratinocytes, and basal keratinocytes. 

The polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could also be used to reduce the side effects of gut toxicity that result from 
radiation, chemotherapy treatments or viral infections. The polynucleotides or 
polypeptides, and/or agonists or antagonists of the invention, may have a cytoprotective 
effect on the small intestine mucosa. The polynucleotides or polypeptides, and/or agonists 
or antagonists of the invention, may also stimulate healing of mucositis (mouth ulcers) that 
result from chemotherapy and viral infections. 

The polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could further be used in full regeneration of skin in full and partial thickness 
skin defects, including burns, (i.e., repopulation of hair follicles, sweat glands, and 
sebaceous glands), treatment of other skin defects such as psoriasis. The polynucleotides or 
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polypeptides, and/or agonists or antagonists of the invention, could be used to treat 
epidermolysis bullosa, a defect in adherence of the epidermis to the underlying dermis 
which results in frequent, open and painful blisters by accelerating reepithelialization of 
these lesions. The polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could also be used to treat gastric and doudenal ulcers and help heal by scar 
formation of the mucosal lining and regeneration of glandular mucosa and duodenal 
mucosal lining more rapidly. Inflamamatory bowel diseases, such as Crohn's disease and 
ulcerative colitis, are diseases which result in destruction of the mucosal surface of the 
small or large intestine, respectively. Thus, the polynucleotides or polypeptides, and/or 
agonists or antagonists of the invention, could be used to promote the resurfacing of the 
mucosal surface to aid more rapid healing and to prevent progression of inflammatory 
bowel disease. Treatment with the polynucleotides or polypeptides, and/or agonists or 
antagonists of the invention, is expected to have a significant effect on the production of 
mucus throughout the gastrointestinal tract and could be used to protect the intestinal 
mucosa from injurious substances that are ingested or following surgery. The 
polynucleotides or polypeptides, and/or agonists or antagonists of the invention, could be 
used to treat diseases associate with the under expression of the polynucleotides of the 
invention. 

Moreover, the polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could be used to prevent and heal damage to the lungs due to various 
pathological states. A growth factor such as the polynucleotides or polypeptides, and/or 
agonists or antagonists of the invention, which could stimulate proliferation and 
differentiation and promote the repair of alveoli and brochiolar epithelium to prevent or 
treat acute or chronic lung damage. For example, emphysema, which results in the 
progressive loss of aveoli, and inhalation injuries, i.e., resulting from smoke inhalation and 
burns, that cause necrosis of the bronchiolar epithelium and alveoli could be effectively 
treated using the polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention. Also, the polynucleotides or polypeptides, and/or agonists or antagonists of the 
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invention, could be used to stimulate the proliferation of and differentiation of type II 
pneumocytes, which may help treat or prevent disease such as hyaline membrane diseases, 
such as infant respiratory distress syndrome and bronchopulmonary displasia, in premature 
infants. 

5 The polynucleotides or polypeptides, and/or agonists or antagonists of the 

invention, could stimulate the proliferation and differentiation of hepatocytes and, thus, 
could be used to alleviate or treat liver diseases and pathologies such as fulminant liver 
failure caused by cirrhosis, liver damage caused by viral hepatitis and toxic substances 
(i.e., acetaminophen, carbon tetraholoride and other hepatotoxins known in the art). 

10 In addition, the polynucleotides or polypeptides, and/or agonists or antagonists of 

the invention, could be used treat or prevent the onset of diabetes mellitus. In patients with 
newly diagnosed Types I and II diabetes, where some islet cell function remains, the 
polynucleotides or polypeptides, and/or agonists or antagonists of the invention, could be 
used to maintain the islet function so as to alleviate, delay or prevent permanent 

15 manifestation of the disease. Also, the polynucleotides or polypeptides, and/or agonists or 
antagonists of the invention, could be used as an auxiliary in islet cell transplantation to 
improve or promote islet cell function. 

Infections Disease 

20 A polypeptide or polynucleotide and/or agonist or antagonist of the present 

invention can be used to treat or detect infectious agents. For example, by increasing the 
immune response, particularly increasing the proliferation and differentiation of B and/or T 
cells, infectious diseases may be treated. The immune response may be increased by either 
enhancing an existing immune response, or by initiating a new immune response. 

25 Alternatively, polypeptide or polynucleotide and/or agonist or antagonist of the present 
invention may also directly inhibit the infectious agent, without necessarily eliciting an 
immune response. 
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Viruses are one example of an infectious agent that can cause disease or symptoms 
that can be treated or detected by a polynucleotide or polypeptide and/or agonist or 
antagonist of the present invention. Examples of viruses, include, but are not limited to 
Examples of viruses, include, but are not limited to the following DNA and RNA viruses 
and viral families: Arbovirus, Adenoviridae, Arenaviridae, Arterivirus, Bimaviridae, 
Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Dengue, EBV, HIV, 
Flaviviridae, Hepadnaviridae (Hepatitis), Herpesviridae (such as, Cytomegalovirus, Herpes 
Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxoviridae, Morbillivirus, 
Rhabdoviridae), Orthomyxoviridae (e.g., Influenza A, Influenza B, and parainfluenza), 
Papiloma virus, Papovaviridae, Parvoviridae, Picomaviridae, Poxviridae (such as 
Smallpox or Vaccinia), Reoviridae (e.g.. Rotavirus), Retroviridae (HTLV-1, HTLV-II, 
Lentivirus), and Togaviridae (e.g., Rubivirus). Viruses falling within these families can 
cause a variety of diseases or symptoms, including, but not limited to: arthritis, 
bronchiolitis, respiratory syncytial virus, encephalitis, eye infections (e.g., conjunctivitis, 
keratitis), chronic fatigue syndrome, hepatitis (A, B, C, E, Chronic Active, Delta), 
Japanese B encephalitis, Junin, Chikungunya, Rift Valley fever, yellow fever, meningitis, 
opportunistic infections (e.g., AIDS), pneumonia, Burkitt's Lymphoma, chickenpox, 
hemorrhagic fever, Measles, Mumps, Parainfluenza, Rabies, the common cold, Polio, 
leukemia, Rubella, sexually transmitted diseases, skin diseases (e.g., Kaposi's, warts), and 
viremia. polynucleotides or polypeptides, or agonists or antagonists of the invention, can 
be used to treat or detect any of these symptoms or diseases. In specific embodiments, 
polynucleotides, polypeptides, or agonists or antagonists of the invention are used to treat: 
meningitis, Dengue, EBV, and/or hepatitis (e.g., hepatitis B). In an additional specific 
embodiment polynucleotides, polypeptides, or agonists or antagonists of the invention are 
used to treat patients nonresponsi ve to one or more other commercially available hepatitis 
vaccines. In a further specific embodiment polynucleotides, polypeptides, or agonists or 
antagonists of the invention are used to treat AIDS. 
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Similarly, bacterial or fungal agents that can cause disease or symptoms and that 
can be treated or detected by a polynucleotide or polypeptide and/or agonist or antagonist 
of the present invention include, but not limited to, include, but not limited to, the 
following Gram-Negative and Gram-positive bacteria and bacterial families and fungi: 
5 Actinomycetales (e.g., Corynebacterium, Mycobacterium, Norcardia), Cryptococcus 
neoformans, Aspergillosis, Bacillaceae (e.g., Anthrax, Clostridium), Bacteroidaceae, 
Blastomycosis, Bordetella, Borrelia (e.g., Borrelia burgdorferi, Brucellosis, Candidiasis, 
Campylobacter, Coccidioidomycosis, Cryptococcosis, Dermatocycoses, E. coli (e.g., 
Enterotoxigenic E. coli and Enterohemorrhagic E. coli), Enterobacteriaceae (Klebsiella, 

10 Salmonella (e.g., Salmonella typhi, and Salmonella paratyphi), Serratia, Yersinia), 
Erysipelothrix, Helicobacter, Legionellosis, Leptospirosis, Listeria, Mycoplasmatales, 
Mycobacterium leprae, Vibrio cholerae, Neisseriaceae (e.g., Acinetobacter, Gonorrhea, 
Menigococcal), Meisseria meningitidis, Pasteurellacea Infections (e.g., Actinobacillus, 
Heamophilus (e.g., Heamophilus influenza type B), Pasteurella), Pseudomonas, 

15 Rickettsiaceae, Chlamydiaceae, Syphilis, Shigella spp., Staphylococcal, Meningococcal, 
Pneumococcal and Streptococcal (e.g., Streptococcus pneumoniae and Group B 
Streptococcus). These bacterial or fungal families can cause the following diseases or 
symptoms, including, but not limited to: bacteremia, endocarditis, eye infections 
(conjunctivitis, tuberculosis, uveitis), gingivitis, opportunistic infections (e.g., AIDS 

20 related infections), paronychia, prosthesis-related infections, Reiter's Disease, respiratory 
tract infections, such as Whooping Cough or Empyema, sepsis, Lyme Disease, Cat-Scratch 
Disease, Dysentery, Paratyphoid Fever, food poisoning, Typhoid, pneumonia, Gonorrhea, 
meningitis (e.g., mengitis types A and B), Chlamydia, Syphilis, Diphtheria, Leprosy, 
Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, tetanus, impetigo, Rheumatic 

25 Fever, Scarlet Fever, sexually transmitted diseases, skin diseases (e.g., cellulitis, 

dermatocycoses), toxemia, urinary tract infections, wound infections. Polynucleotides or 
polypeptides, agonists or antagonists of the invention, can be used to treat or detect any of 
these symptoms or diseases. In specific embodiments, Ppolynucleotides, polypeptides, 

SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 „^ 

PCnyUS99/25031 

434 

agonists or antagonists of the invention are used to treat: tetanus, Diptheria, botulism, 
and/or meningitis type B. 

Moreover, parasitic agents causing disease or symptoms that can be treated or 
detected by a polynucleotide or polypeptide and/or agonist or antagonist of the present 
invention include, but not limited to, the following families or class: Amebiasis, 
Babesiosis, Coccidiosis, Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, 
Giardiasis, Helminthiasis, Leishmaniasis, Theileriasis, Toxoplasmosis, Trypanosomiasis, 
and Trichomonas and Sporozoans (e.g., Plasmodium virax, Plasmodium falciparum, 
Plasmodium malariae and Plasmodium ovale). These parasites can cause a variety of 
diseases or symptoms, including, but not limited to: Scabies, Trombiculiasis, eye 
infections, intestinal disease (e.g., dysentery, giardiasis), liver disease, lung disease, 
opportunistic infections (e.g., AIDS related), malaria, pregnancy complications, and 
toxoplasmosis, polynucleotides or polypeptides, or agonists or antagonists of the invention, 
can be used to treat or detect any of these symptoms or diseases. In specific embodiments, 
polynucleotides, polypeptides, or agonists or antagonists of the invention are used to treat 
malaria. 

Preferably, treatment using a polypeptide or polynucleotide and/or agonist or 
antagonist of the present invention could either be by administering an effective amount of 
a polypeptide to the patient, or by removing cells from the patient, supplying the cells with 
a polynucleotide of the present invention, and returning the engineered cells to the patient 
(ex vivo therapy). Moreover, the polypeptide or polynucleotide of the present invention 
can be used as an antigen in a vaccine to raise an immune response against infectious 
disease. 



Regeneration 

A polynucleotide or polypeptide and/or agonist or antagonist of the present 
invention can be used to differentiate, proliferate, and attract cells, leading to the 
regeneration of tissues. (See, Science 276:59-87 (1997).) The regeneration of tissues 
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could be used to repair, replace, or protect tissue damaged by congenital defects, trauma 
(wounds, burns, incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, 
periodontal disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, 
reperfusion injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs (e.g., 
pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac), 
vasculature (including vascular and lymphatics), nervous, hematopoietic, and skeletal 
(bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration occurs without or 
decreased scarring. Regeneration also may include angiogenesis. 

Moreover, a polynucleotide or polypeptide and/or agonist or antagonist of the 
present invention may increase regeneration of tissues difficult to heal. For example, 
increased tendon/ligament regeneration would quicken recovery time after damage. A 
polynucleotide or polypeptide and/or agonist or antagonist of the present invention could 
also be used prophylactically in an effort to avoid damage. Specific diseases that could be 
treated include of tendinitis, carpal tunnel syndrome, and other tendon or ligament defects. 
A further example of tissue regeneration of non-healing wounds includes pressure ulcers, 
ulcers associated with vascular insufficiency, surgical, and traumatic wounds. 

Similarly, nerve and brain tissue could also be regenerated by using a 
polynucleotide or polypeptide and/or agonist or antagonist of the present invention to 
proliferate and differentiate nerve cells. Diseases that could be treated using this method 
include central and peripheral nervous system diseases, neuropathies, or mechanical and 
traumatic disorders (e.g., spinal cord disorders, head trauma, cerebrovascular disease, and 
stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 
neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 
neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, Parkinson's 
disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager syndrome), 
could all be treated using the polynucleotide or polypeptide and/or agonist or antagonist of 
the present invention. 
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Chemotaxis 

A polynucleotide or polypeptide and/or agonist or antagonist of the present 
invention may have chemotaxis activity. A chemotaxic molecule attracts or mobilizes 
cells (e.g., monocytes, fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial 
and/or endothelial cells) to a particular site in the body, such as inflammation, infection, or 
site of hyperproliferation. The mobilized cells can then fight off and/or heal the particular 
trauma or abnormality. 

A polynucleotide or polypeptide and/or agonist or antagonist of the present 
invention may increase chemotaxic activity of particular cells. These chemotactic 
molecules can then be used to treat inflammation, infection, hyperproliferati ve disorders, 
or any immune system disorder by increasing the number of cells targeted to a particular 
location in the body. For example, chemotaxic molecules can be used to treat wounds and 
other trauma to tissues by attracting immune cells to the injured location. Chemotactic 
molecules of the present invention can also attract fibroblasts, which can be used to treat 
wounds. 

It is also contemplated that a polynucleotide or polypeptide and/or agonist or 
antagonist of the present invention may inhibit chemotactic activity. These molecules 
could also be used to treat disorders. Thus, a polynucleotide or polypeptide and/or agonist 
,or antagonist of the present invention could be used as an inhibitor of chemotaxis. 

Binding Activity 

A polypeptide of the present invention may be used to screen for molecules that 
bind to the polypeptide or for molecules to which the polypeptide binds. The binding of 
the polypeptide and the molecule may activate (agonist), increase, inhibit (antagonist), or 
decrease activity of the polypeptide or the molecule bound. Examples of such molecules 
include antibodies, oligonucleotides, proteins (e.g., receptors),or small molecules. 
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Preferably, the molecule is closely related to the natural ligand of the polypeptide, 
e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural or functional 
mimetic. (See, Coligan et al„ Current Protocols in Immunology l(2):Chapter 5 (1991).) 
Similarly, the molecule can be closely related to the natural receptor to which the 
polypeptide binds, or at least, a fragment of the receptor capable of being bound by the 
polypeptide (e.g., active site). In either case, the molecule can be rationally designed using 
known techniques. 

Preferably, the screening for these molecules involves producing appropriate cells 
which express the polypeptide, either as a secreted protein or on the cell membrane. 
Preferred cells include cells from mammals, yeast, Drosophila, or E. coli. Cells expressing 
the polypeptide (or cell membrane containing the expressed polypeptide) are then 
preferably contacted with a test compound potentially containing the molecule to observe 
binding, stimulation, or inhibition of activity of either the polypeptide or the molecule. 

The assay may simply test binding of a candidate compound to the polypeptide, 
wherein binding is detected by a label, or in an assay involving competition with a labeled 
competitor. Further, the assay may test whether the candidate compound results in a signal 
generated by binding to the polypeptide. 

Alternatively, the assay can be carried out using cell-free preparations, 
polypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate compound 
with a solution containing a polypeptide, measuring polypeptide/molecule activity or 
binding, and comparing the polypeptide/molecule activity or binding to a standard. 

Preferably, an ELISA assay can measure polypeptide level or activity in a sample 
(e.g., biological sample) using a monoclonal or polyclonal antibody. The antibody can 
measure polypeptide level or activity by either binding, directly or indirectly, to the 
polypeptide or by competing with the polypeptide for a substrate. 

Additionally, the receptor to which a polypeptide of the invention binds can be 
identified by numerous methods known to those of skill in the art, for example, ligand 
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panning and FACS sorting (Coligan, et al., Current Protocols in Immun., 1(2), Chapter 5, 
(1991)). For example, expression cloning is employed wherein polyadenylated RNA is 
prepared from a cell responsive to the polypeptides, for example, NIH3T3 cells which are 
known to contain multiple receptors for the FGF family proteins, and SC-3 cells, and a 
5 cDNA library created from this RNA is divided into pools and used to transfect COS cells 
or other cells that are not responsive to the polypeptides. Transfected cells which are 
grown on glass slides are exposed to the polypeptide of the present invention, after they 
have been labelled. The polypeptides can be labeled by a variety of means including 
iodination or inclusion of a recognition site for a site-specific protein kinase. 
10 Following fixation and incubation, the slides are subjected to auto-radiographic 

analysis. Positive pools are identified and sub-pools are prepared and re-transfected using 
an iterative sub-pooling and re-screening process, eventually yielding a single clones that 
encodes the putative receptor. 

As an alternative approach for receptor identification, the labeled polypeptides can 
15 be photoaffinity linked with cell membrane or extract preparations that express the 
receptor molecule. Cross-linked material is resolved by PAGE analysis and exposed to X- 
ray film. The labeled complex containing the receptors of the polypeptides can be excised, 
resolved into peptide fragments, and subjected to protein microsequencing. The amino 
acid sequence obtained from microsequencing would be used to design a set of degenerate 
20 oligonucleotide probes to screen a cDNA library to identify the genes encoding the 
putative receptors. 

Moreover, the techniques of gene-shuffling, motif-shuffling, exon-shuffling, and/or 
codon-shuffling (collectively referred to as "DNA shuffling") may be employed to 
modulate the activities of polypeptides of the invention thereby effectively generating 
25 agonists and antagonists of polypeptides of the invention. See generally, U.S. Patent Nos. 
5,605,793. 5,811,238, 5,830,721, 5,834,252, and 5,837,458, and Patten, P. A., et al., Curr. 
Opinion Biotechnol. 8:724-33 (1997); Harayama, S. Trends Biotechnol. 16(2):76-82 
(1998); Hansson, L. O., et al., J. Mol. Biol. 287:265-76 (1999); and Lorenzo, M. M. and 

SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

439 

Blasco, R. Biotechniques 24(2):308-13 (1998) (each of these patents and publications are 
hereby incorporated by reference). In one embodiment, alteration of polynucleotides and 
corresponding polypeptides of the invention may be achieved by DNA shuffling. DNA 
shuffling involves the assembly of two or more DNA segments into a desired 
5 polynucleotide sequence of the invention molecule by homologous, or site-specific, 
recombination. In another embodiment, polynucleotides and corresponding polypeptides 
of the invention may be alterred by being subjected to random mutagenesis by error-prone 
PCR, random nucleotide insertion or other methods prior to recombination. In another 
embodiment, one or more components, motifs, sections, parts, domains, fragments, etc., of 

10 the polypeptides of the invention may be recombined with one or more components, 
motifs, sections, parts, domains, fragments, etc. of one or more heterologous molecules. In 
preferred embodiments, the heterologous molecules are family members. In further 
preferred embodiments, the heterologous molecule is a growth factor such as, for example, 
platelet-derived growth factor (PDGF). insulin-like growth factor (IGF-I), transforming 

15 growth factor (TGF)-alpha, epidermal growth factor (EGF), fibroblast growth factor 
(FGF), TGF-beta, bone morphogenetic protein (BMP)-2, BMP-4, BMP-5, BMP-6, BMP-7, 
activins A and B, decapentaplegic(dpp), 60A, OP-2, dorsalin, growth differentiation 
factors (GDFs), nodal, MIS, inhibin-alpha, TGF-betal, TGF-beta2, TGF-beta3, TGF- 
beta5, and glial-derived neurotrophic factor (GDNF). 

20 Other preferred fragments are biologically active fragments of the polypeptides of 

the invention. Biologically active fragments are those exhibiting activity similar, but not 
necessarily identical, to an activity of the polypeptide. The biological activity of the 
fragments may include an improved desired activity, or a decreased undesirable activity. 

Additionally, this invention provides a method of screening compounds to identify 

25 those which modulate the action of the polypeptide of the present invention. An example 
of such an assay comprises combining a mammalian fibroblast cell, a the polypeptide of 
the present invention, the compound to be screened and 3[H] thymidine under cell culture 
conditions where the fibroblast cell would normally proliferate. A control assay may be 
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performed in the absence of the compound to be screened and compared to the amount of 
fibroblast proliferation in the presence of the compound to determine if the compound 
stimulates proliferation by determining the uptake of 3[H] thymidine in each case. The 
amount of fibroblast cell proliferation is measured by liquid scintillation chromatography 
which measures the incorporation of 3[H] thymidine. Both agonist and antagonist 
compounds may be identified by this procedure. 

In another method, a mammalian cell or membrane preparation expressing a 
receptor for a polypeptide of the present invention is incubated with a labeled polypeptide 
of the present invention in the presence of the compound. The ability of the compound to 
enhance or block this interaction could then be measured. Alternatively, the response of a 
known second messenger system following interaction of a compound to be screened and 
the receptor is measured and the ability of the compound to bind to the receptor and elicit a 
second messenger response is measured to determine if the compound is a potential agonist 
or antagonist. Such second messenger systems include but are not limited to, cAMP 
guanylate cyclase, ion channels or phosphoinositide hydrolysis. 

All of these above assays can be used as diagnostic or prognostic markers. The 
molecules discovered using these assays can be used to treat disease or to bring about a 
particular result in a patient (e.g., blood vessel growth) by activating or inhibiting the 
polypeptide/molecule. Moreover, the assays can discover agents which may inhibit or 
enhance the production of the polypeptides of the invention from suitably manipulated 
cells or tissues. Therefore, the invention includes a method of identifying compounds 
which bind to the polypeptides of the invention comprising the steps of: (a) incubating a 
candidate binding compound with the polypeptide; and (b) determining if binding has 
occurred. Moreover, the invention includes a method of identifying agonists/antagonists 
comprising the steps of: (a) incubating a candidate compound with the polypeptide, (b) 
assaying a biological activity , and (b) determining if a biological activity of the 
polypeptide has been altered. 

Also, one could identify molecules bind a polypeptide of the invention 
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experimentally by using the beta-pleated sheet regions contained in the polypeptide 
sequence of the protein. Accordingly, specific embodiments of the invention are directed 
to polynucleotides encoding polypeptides which comprise, or alternatively consist of, the 
amino acid sequence of each beta pleated sheet regions in a disclosed polypeptide 
sequence. Additional embodiments of the invention are directed to polynucleotides 
encoding polypeptides which comprise, or alternatively consist of, any combination or all 
of contained in the polypeptide sequences of the invention. Additional preferred 
embodiments of the invention are directed to polypeptides which comprise, or alternatively 
consist of, the amino acid sequence of each of the beta pleated sheet regions in one of the 
polypeptide sequences of the invention. Additional embodiments of the invention are 
directed to polypeptides which comprise, or alternatively consist of. any combination or all 
of the beta pleated sheet regions in one of the polypeptide sequences of the invention. 

Targeted Delivery 

In another embodiment, the invention provides a method of delivering 
compositions to targeted cells expressing a receptor for a polypeptide of the invention, or 
cells expressing a cell bound form of a polypeptide of the invention. 

As discussed herein, polypeptides or antibodies of the invention may be associated 
with heterologous polypeptides, heterologous nucleic acids, toxins, or prodrugs via 
hydrophobic, hydrophilic, ionic and/or covalent interactions. 

In one embodiment, the invention provides a method for the specific delivery of 
compositions of the invention to cells by administering polypeptides of the invention 
(including antibodies) that are associated with heterologous polypeptides or nucleic acids. 
In one example, the invention provides a method for delivering a therapeutic protein into 
the targeted cell. In another example, the invention provides a method for delivering a 
single stranded nucleic acid (e.g., antisense orribozymes) or double stranded nucleic acid 
(e.g., DNA that can integrate into the cell's genome or replicate episomally and that can be 
transcribed) into the targeted cell. 
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In another embodiment, the invention provides a method for the specific 
destruction of cells (e.g., the destruction of tumor cells) by administering polypeptides of 
the invention (e.g., polypeptides of the invention or antibodies of the invention) in 
association with toxins or cytotoxic prodrugs. 

By "toxin" is meant compounds that bind and activate endogenous cytotoxic 
effector systems, radioisotopes, holotoxins, modified toxins, catalytic subunits of toxins, or 
any molecules or enzymes not normally present in or on the surface of a cell that under 
defined conditions cause the cell's death. Toxins that may be used according to the 
methods of the invention include, but are not limited to, radioisotopes known in the art, 
compounds such as, for example, antibodies (or complement fixing containing portions 
thereof) that bind an inherent or induced endogenous cytotoxic effector system, thymidine 
kinase, endonuclease, RNAse, alpha toxin, ricin, abrin, Pseudomonas exotoxin A, 
diphtheria toxin, saporin, momordin, gelonin, pokeweed antiviral protein, alpha-sarcin and 
cholera toxin. By "cytotoxic prodrug" is meant a non-toxic compound that is converted by 
an enzyme, normally present in the cell, into a cytotoxic compound. Cytotoxic prodrugs 
that may be used according to the methods of the invention include, but are not limited to, 
glutamyl derivatives of benzoic acid mustard alkylating agent, phosphate derivatives of 
etoposide or mitomycin C, cytosine arabinoside, daunorubisin, and phenoxyacetamide 
derivatives of doxorubicin. 

Drug Screening 

Further contemplated is the use of the polypeptides of the present invention, or the 
polynucleotides encoding these polypeptides, to screen for molecules which modify the 
activities of the polypeptides of the present invention. Such a method would include 
contacting the polypeptide of the present invention with a selected compound(s) suspected 
of having antagonist or agonist activity, and assaying the activity of these polypeptides 
following binding. 

This invention is particularly useful for screening therapeutic compounds by using 
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the polypeptides of the present invention, or binding fragments thereof, in any of a variety 
of drug screening techniques. The polypeptide or fragment employed in such a test may be 
affixed to a solid support, expressed on a cell surface, free in solution, or located 
intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells 
which are stably transformed with recombinant nucleic acids expressing the polypeptide or 
fragment. Drugs are screened against such transformed cells in competitive binding assays. 
One may measure, for example, the formulation of complexes between the agent being 
tested and a polypeptide of the present invention. 

Thus, the present invention provides methods of screening for drugs or any other 
agents which affect activities mediated by the polypeptides of the present invention. These 
methods comprise contacting such an agent with a polypeptide of the present invention or 
a fragment thereof and assaying for the presence of a complex between the agent and the 
polypeptide or a fragment thereof, by methods well known in the art. In such a competitive 
binding assay, the agents to screen are typically labeled. Following incubation, free agent 
is separated from that present in bound form, and the amount of free or uncomplexed label 
is a measure of the ability of a particular agent to bind to the polypeptides of the present 
invention. 

Another technique for drug screening provides high throughput screening for 
compounds having suitable binding affinity to the polypeptides of the present invention, 
and is described in great detail in European Patent Application 84/03564, published on 
September 13, 1984, which is incorporated herein by reference herein. Briefly stated, large 
numbers of different small peptide test compounds are synthesized on a solid substrate, 
such as plastic pins or some other surface. The peptide test compounds are reacted with 
polypeptides of the present invention and washed. Bound polypeptides are then detected by 
methods well known in the art. Purified polypeptides are coated directly onto plates for use 
in the aforementioned drug screening techniques. In addition, non-neutralizing antibodies 
may be used to capture the peptide and immobilize it on the solid support. 
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This invention also contemplates the use of competitive drug screening assays in 
which neutralizing antibodies capable of binding polypeptides of the present invention 
specifically compete with a test compound for binding to the polypeptides or fragments 
thereof. In this manner, the antibodies are used to detect the presence of any peptide which 
shares one or more antigenic epitopes with a polypeptide of the invention. 

Antisense And Rihnz vme (Antaff nnktB) 

In specific embodiments, antagonists according to the present invention are nucleic 
acids corresponding to the sequences contained in SEQ ID NO:X, or the complementary 
strand thereof, and/or to nucleotide sequences contained a deposited clone. In one 
embodiment, antisense sequence is generated internally by the organism, in another 
embodiment, the antisense sequence is separately administered (see, for example, 
O'Connor, Neurochem., 56:560 (1991). Oligodeoxynucleotides as Anitsense Inhibitors of 
Gene Expression, CRC Press, Boca Raton, FL (1988). Antisense technology can be used 
to control gene expression through antisense DNA or RNA, or through triple-helix 
formation. Antisense techniques are discussed for example, in Okano, Neurochem., 
56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC 
Press, Boca Raton, FL (1988). Triple helix formation is discussed in, for instance, Lee et 
al., Nucleic Acids Research, 6:3073 (1979); Cooney et al. 5 Science, 241:456 (1988); and 
Dervan et al., Science, 251:1300 (1991). The methods are based on binding of a 
polynucleotide to a complementary DNA or RNA. 

For example, the use of c-myc and c-myb antisense RNA constructs to inhibit the 
growth of the non-lymphocytic leukemia cell line HL-60 and other cell lines was 
previously described. (Wickstrom et al. (1988); Anfossi et al. (1989)). These experiments 
were performed in vitro by incubating cells with the oligoribonucleotide. A similar 
procedure for in vivo use is described in WO 91/15580. Briefly, a pair of oligonucleotides 
for a given antisense RNA is produced as follows: A sequence complimentary to the first 
15 bases of the open reading frame is flanked by an EcoRl site on the 5 end and a Hindlll 
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site on the 3 end. Next, the pair of oligonucleotides is heated at 90°C for one minute and 
then annealed in 2X ligation buffer (20mM TRIS HC1 pH 7.5, lOmM MgC12, 10MM 
dithiothreitol (DTT) and 0.2 mM ATP) and then ligated to the EcoRl/Hind III site of the 
retroviral vector PMV7 (WO 91/15580). 

For example, the 5' coding portion of a polynucleotide that encodes the mature 
polypeptide of the present invention may be used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is 
designed to be complementary to a region of the gene involved in transcription thereby 
preventing transcription and the production of the receptor. The antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the mRNA 
molecule into receptor polypeptide. 

In one embodiment, the antisense nucleic acid of the invention is produced 
intracellular^ by transcription from an exogenous sequence. For example, a vector or a 
portion thereof, is transcribed, producing an antisense nucleic acid (RNA) of the invention. 
Such a vector would contain a sequence encoding the antisense nucleic acid of the 
invention. Such a vector can remain episomal or become chromosomally integrated, as 
long as it can be transcribed to produce the desired antisense RNA. Such vectors can be 
constructed by recombinant DNA technology methods standard in the art. Vectors can be 
plasmid, viral, or others known in the art, used for replication and expression in vertebrate 
cells. Expression of the sequence encoding a polypeptide of the invention, or fragments 
thereof, can be by any promoter known in the art to act in vertebrate, preferably human 
cells. Such promoters can be inducible or constitutive. Such promoters include, but are 
not limited to, the SV40 early promoter region (Bernoist and Chambon, Nature, 29:304- 
310 (1981), the promoter contained in the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto et al., Cell, 22:787-797 (1980), the herpes thymidine promoter (Wagner et al., 
Proc. Natl Acad. Sci. U.S.A., 78:1441-1445 (1981), the regulatory sequences of the 
metal Iothionein gene (Brinster et al., Nature, 296:39-42 (1982)), etc. 
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The antisense nucleic acids of the invention comprise a sequence complementary to 
at least a portion of an RNA transcript of a gene of interest. However, absolute 
complementarity, although preferred, is not required. A sequence "complementary to at 
least a portion of an RNA," referred to herein, means a sequence having sufficient 
complementarity to be able to hybridize with the RNA, forming a stable duplex; in the case 
of double stranded antisense nucleic acids of the invention, a single strand of the duplex 
DNA may thus be tested, or triplex formation may be assayed. The ability to hybridize 
will depend on both the degree of complementarity and the length of the antisense nucleic 
acid Generally, the larger the hybridizing nucleic acid, the more base mismatches with a 
RNA sequence of the invention it may contain and still form a stable duplex (of triplex as 
the case may be). One skilled in the art can ascertain a tolerable degree of mismatch by 
use of standard procedures to determine the melting point of the hybridized complex. 

Oligonucleotides that are complementary to the 5' end of the message, e.g., the 5' 
untranslated sequence up to and including the AUG initiation codon, should work most 
efficiently at inhibiting translation. However, sequences complementary to the 3' 
untranslated sequences of mRNAs have been shown to be effective at inhibiting translation 
ofmRNAsaswell. See generally, Wagner, R., Nature, 372:333-335(1994). Thus, 
oligonucleotides complementary to either the 5' - or 3' - non- translated, non-coding 
regions of a polynucleotide sequence of the invention could be used in an antisense 
approach to inhibit translation of endogenous mRNA. Oligonucleotides complementary to 
the 5' untranslated region of the mRNA should include the complement of the AUG start 
codon. Antisense oligonucleotides complementary to mRNA coding regions are less 
efficient inhibitors of translation but could be used in accordance with the invention. 
Whether designed to hybridize to the 5' -, 3' - or coding region of mRNA, antisense 
nucleic acids should be at least six nucleotides in length, and are preferably 
oligonucleotides ranging from 6 to about 50 nucleotides in length. In specific aspects the 
oligonucleotide is at least 10 nucleotides, at least 17 nucleotides, at least 25 nucleotides or 
at least 50 nucleotides. 
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The polynucleotides of the invention can be DNA or RNA or chimeric mixtures or 
derivatives or modified versions thereof, single-stranded or double-stranded. The 
oligonucleotide can be modified at the base moiety, sugar moiety, or phosphate backbone, 
for example, to improve stability of the molecule, hybridization, etc. The oligonucleotide 
may include other appended groups such as peptides (e.g., for targeting host cell receptors 
in vivo), or agents facilitating transport across the cell membrane (see, e.g., Letsinger et al., 
Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556 (1989); Lemaitreet al., Proc. Natl. Acad. Sci., 
84:648-652 (1987); PCT Publication NO: WO88/09810, published December 15, 1988) or 
the blood-brain barrier (see, e.g., PCT Publication NO: WO89/10134, published April 25, 
1988), hybridization-triggered cleavage agents. (See, e.g., Krol et al., BioTechniques, 
6:958-976 (1988)) or intercalating agents. (See, e.g., Zon, Pharm. Res., 5:539-549 
(1988)). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, hybridization triggered cross-linking agent, transport agent, hybridization- 
triggered cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one modified base moiety 
which is selected from the group including, but not limited to, 5-fluorouracil, 
5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 
5-(carboxyhydroxylrnethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 
5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D- m annosylqueosine,5'-methoxycarboxymethyluracil, 5-methoxyuracil, 
2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 
5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3-(3-amino-3-N-2-carboxypropyI) uracil, (acp3)w, and 2,6-diaminopurine. 
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The antisense oligonucleotide may also comprise at least one modified sugar 
moiety selected from the group including, but not limited to, arabinose, 2-fIuoroarabinose, 

xylulose, and hexose. 

In yet another embodiment, the antisense oligonucleotide comprises at least one 
modified phosphate backbone selected from the group including, but not limited to, a 
phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a 
phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, and a formacetal or 
analog thereof. 

In yet another embodiment, the antisense oligonucleotide is an a-anomeric 
oligonucleotide. An a-anomeric oligonucleotide forms specific double-stranded hybrids 
with complementary RNA in which, contrary to the usual b-units, the strands run parallel 
to each other (Gautier et al., Nucl. Acids Res., 15:6625-6641 (1987)). The oligonucleotide 
is a 2-0-methylribonucleotide (Inoue et al., Nucl. Acids Res., 15:6131-6148 (1987)), or a 
chimeric RNA-DNA analogue (Inoue et al., FEBS Lett. 215:327-330 (1987)). 

Polynucleotides of the invention may be synthesized by standard methods known in 
the art, e.g. by use of an automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate 
oligonucleotides may be synthesized by the method of Stein et al. (Nucl. Acids Res., 
16:3209 (1988)), methylphosphonate oligonucleotides can be prepared by use of controlled 
pore glass polymer supports (Sarin et al., Proc. Natl. Acad. Sci. U.S.A., 85:7448-745 1 
(1988)), etc. 

While antisense nucleotides complementary to the coding region sequence of the 
invention could be used, those complementary to the transcribed untranslated region are 

most preferred. 

Potential antagonists according to the invention also include catalytic RNA, or a 
ribozyme (See, e.g., PCT International Publication WO 90/1 1364, published October 4, 
1990; Sarver et al, Science, 247:1222-1225 (1990). While ribozymes that cleave mRNA at 
site specific recognition sequences can be used to destroy mRNAs corresponding to the 
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polynucleotides of the invention, the use of hammerhead ribozymes is preferred. 
Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form 
complementary base pairs with the target mRNA. The sole requirement is that the target 
mRNA have the following sequence of two bases: 5' -UG-3' . The construction and 
5 production of hammerhead ribozymes is well known in the art and is described more fully 
in Haseloff and Gerlach, Nature, 334:585-591 (1988). There are numerous potential 
hammerhead ribozyme cleavage sites within each nucleotide sequence disclosed in the 
sequence listing. Preferably, the ribozyme is engineered so that the cleavage recognition 
site is located near the 5' end of the mRNA corresponding to the polynucleotides of the 

10 invention; i.e., to increase efficiency and minimize the intracellular accumulation of non- 
functional mRNA transcripts. 

As in the antisense approach, the ribozymes of the invention can be composed of 
modified oligonucleotides (e.g. for improved stability, targeting, etc.) and should be 
delivered to cells which express the polynucleotides of the invention in vivo. DNA 

15 constructs encoding the ribozyme may be introduced into the cell in the same manner as 
described above for the introduction of antisense encoding DNA. A preferred method of 
delivery involves using a DNA construct "encoding" the ribozyme under the control of a 
strong constitutive promoter, such as, for example, pol III or pol II promoter, so that 
transfected cells will produce sufficient quantities of the ribozyme to destroy endogenous 

20 messages and inhibit translation. Since ribozymes unlike antisense molecules, are 
catalytic, a lower intracellular concentration is required for efficiency. 

Antagonist/agonist compounds may be employed to inhibit the cell growth and 
proliferation effects of the polypeptides of the present invention on neoplastic cells and 
tissues, i.e. stimulation of angiogenesis of tumors, and, therefore, retard or prevent 

25 abnormal cellular growth and proliferation, for example, in tumor formation or growth. 

The antagonist/agonist may also be employed to prevent hyper-vascular diseases, 
and prevent the proliferation of epithelial lens cells after extracapsular cataract surgery. 
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Prevention of the mitogenic activity of the polypeptides of the present invention may also 
be desirous in cases such as restenosis after balloon angioplasty. 

The antagonist/agonist may also be employed to prevent the growth of scar tissue 
during wound healing. 

The antagonist/agonist may also be employed to treat the diseases described herein. 
Thus, the invention provides a method of treating disorders or diseases, including but not 
limited to the disorders or diseases listed throughout this application, associated with 
oppression of a polynucleotide of the present invention by administering to a patient 

(a) an antisense molecule directed to the polynucleotide of the present invention, and/or 

(b) a ribozyme directed to the polynucleotide of the present invention 
Other Activitie s 

The polypeptide of the present invention, as a result of the ability to stimulate 
vascular endothelial cell growth, may be employed in treatment for stimulating re- 
vascularization of ischemic tissues due to various disease conditions such as thrombosis, 
arteriosclerosis, and other cardiovascular conditions. These polypeptide may also be 
employed to stimulate angiogenesis and limb regeneration, as discussed above. 

The polypeptide may also be employed for treating wounds due to injuries, bums, 
post-operative tissue repair, and ulcers since they are mitogenic to various cells of different 
origins, such as fibroblast cells and skeletal muscle cells, and therefore, facilitate the repair 
or replacement of damaged or diseased tissue. 

The polypeptide of the present invention may also be employed stimulate neuronal 
growth and to treat and prevent neuronal damage which occurs in certain neuronal 
disorders or neuro-degenerative conditions such as Alzheimer's disease, Parkinson's 
disease, and AIDS-related complex. The polypeptide of the invention may have the ability 
to stimulate chondrocyte growth, therefore, they may be employed to enhance bone and 
periodontal regeneration and aid in tissue transplants or bone grafts. 

The polypeptide of the present invention may be also be employed to prevent skin 
aging due to sunburn by stimulating keratinocyte growth. 
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The polypeptide of the invention may also be employed for preventing hair loss, 
since FGF family members activate hair-forming cells and promotes melanocyte growth. 
Along the same lines, the polypeptides of the present invention may be employed to 
stimulate growth and differentiation of hematopoietic cells and bone marrow cells when 
5 used in combination with other cytokines. 

The polypeptide of the invention may also be employed to maintain organs before 
transplantation or for supporting cell culture of primary tissues. 

The polypeptide of the present invention may also be employed for inducing tissue 
of mesodermal origin to differentiate in early embryos. 
10 The polypeptide or polynucleotides and/or agonist or antagonists of the present 

invention may also increase or decrease the differentiation or proliferation of embryonic 
stem cells, besides, as discussed above, hematopoietic lineage. 

The polypeptide or polynucleotides and/or agonist or antagonists of the present 
invention may also be used to modulate mammalian characteristics, such as body height, 
15 weight, hair color, eye color, skin, percentage of adipose tissue, pigmentation, size, and 
shape (e.g., cosmetic surgery). Similarly, polypeptides or polynucleotides and/or agonist 
or antagonists of the present invention may be used to modulate mammalian metabolism 
affecting catabolism, anabolism, processing, utilization, and storage of energy. 

Polypeptide or polynucleotides and/or agonist or antagonists of the present 
20 invention may be used to change a mammal's mental state or physical state by influencing 
biorhythms, caricadic rhythms, depression (including depressive disorders), tendency for 
violence, tolerance for pain, reproductive capabilities (preferably by Activin or Inhibin-like 
activity), hormonal or endocrine levels, appetite, libido, memory, stress, or other cognitive 
qualities. 

25 Polypeptide or polynucleotides and/or agonist or antagonists of the present 

invention may also be used as a food additive or preservative, such as to increase or 
decrease storage capabilities, fat content, lipid, protein, carbohydrate, vitamins, minerals, 
cofactors or other nutritional components. 
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Other Preferred Rmhndim>n^ 

Other preferred embodiments of the claimed invention include an isolated nucleic 
acid molecule comprising a nucleotide sequence which is at least 95% identical to a 
sequence of at least about 50 contiguous nucleotides in the nucleotide sequence of SEQ ID 
NO:X wherein X is any integer as defined in Table XIII. 

_ Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
Clone Sequence and ending with the nucleotide at about the position of the 3' Nucleotide 
of the Clone Sequence as defined for SEQ ID NO:X in Table XIII. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
Start Codon and ending with the nucleotide at about the position of the 3' Nucleotide of 
the Clone Sequence as defined for SEQ ID NO:X in Table XIII. 

Similarly preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
First Amino Acid of the Signal Peptide and ending with the nucleotide at about the 
position of the 3 ' Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 



xm. 



Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 150 contiguous 
nucleotides in the nucleotide sequence of SEQ ID NO:X. 

Further preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 500 contiguous 
nucleotides in the nucleotide sequence of SEQ ID NO:X. 
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A further preferred embodiment is a nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to the nucleotide sequence of SEQ ID NO:X 
beginning with the nucleotide at about the position of the 5 ' Nucleotide of the First Amino 
Acid of the Signal Peptide and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table XIII. 

A further preferred embodiment is an isolated nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to the complete nucleotide sequence of 
SEQ ID NO:X. 

Also preferred is an isolated nucleic acid molecule which hybridizes under 
stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic acid 
molecule which hybridizes does not hybridize under stringent hybridization conditions to a 
nucleic acid molecule having a nucleotide sequence consisting of only A residues or of 
only T residues. 

Also preferred is a composition of matter comprising a DNA molecule which 
comprises a human cDNA clone identified by a cDNA Clone Identifier in Table XIII, 
which DNA molecule is contained in the material deposited with the American Type 
Culture Collection and given the ATCC Deposit Number shown in Table XHI for said 
cDNA Clone Identifier. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least 50 contiguous nucleotides 
in the nucleotide sequence of a human cDNA clone identified by a cDNA Clone Identifier 
in Table XIII, which DNA molecule is contained in the deposit given the ATCC Deposit 
Number shown in Table XIII. 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of at 
least 50 contiguous nucleotides is included in the nucleotide sequence of the complete 
open reading frame sequence encoded by said human cDNA clone. 
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Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to sequence of at least 150 contiguous nucleotides 
in the nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to sequence of at least 500 contiguous 
nucleotides in the nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to the complete nucleotide sequence 
encoded by said human cDNA clone. 

A further preferred embodiment is a method for detecting in a biological sample a 
nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical to 
a sequence of at least 50 contiguous nucleotides in a sequence selected from the group 
consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any integer as defined 
in Table XIII; and a nucleotide sequence encoded by a human cDNA clone identified by a 
cDNA Clone Identifier in Table XIII and contained in the deposit with the ATCC Deposit 
Number shown for said cDNA clone in Table XIII; which method comprises a step of 
comparing a nucleotide sequence of at least one nucleic acid molecule in said sample with 
a sequence selected from said group and determining whether the sequence of said nucleic 
acid molecule in said sample is at least 95% identical to said selected sequence. 

. Also preferred is the above method wherein said step of comparing sequences 
comprises determining the extent of nucleic acid hybridization between nucleic acid 
molecules in said sample and a nucleic acid molecule comprising said sequence selected 
from said group. Similarly, also preferred is the above method wherein said step of 
comparing sequences is performed by comparing the nucleotide sequence determined from 
a nucleic acid molecule in said sample with said sequence selected from said group. The 
nucleic acid molecules can comprise DNA molecules or RNA molecules. 

A further preferred embodiment is a method for identifying the species, tissue or 
cell type of a biological sample which method comprises a step of detecting nucleic acid 
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molecules in said sample, if any, comprising a nucleotide sequence that is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any integer 
as defined in Table XIII; and a nucleotide sequence encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table XIII and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table XIII. 

The method for identifying the species, tissue or cell type of a biological sample 
can comprise a step of detecting nucleic acid molecules comprising a nucleotide sequence 
in a panel of at least two nucleotide sequences, wherein at least one sequence in said panel 
is at least 95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from said group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted protein 
identified in Table XIII, which method comprises a step of detecting in a biological sample 
obtained from said subject nucleic acid molecules, if any, comprising a nucleotide 
sequence that is at least 95% identical to a sequence of at least 50 contiguous nucleotides 
in a sequence selected from the group consisting of: a nucleotide sequence of SEQ ID 
NO:X wherein X is any integer as defined in Table XIII; and a nucleotide sequence 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table XIII and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table XIII. 

The method for diagnosing a pathological condition can comprise a step of 
detecting nucleic acid molecules comprising a nucleotide sequence in a panel of at least 
two nucleotide sequences, wherein at least one sequence in said panel is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
said group. 

Also preferred is a composition of matter comprising isolated nucleic acid 
molecules wherein the nucleotide sequences of said nucleic acid molecules comprise a 
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panel of at least two nucleotide sequences, wherein at least one sequence in said panel is at 
least 95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X 
is any integer as defined in Table XIII; and a nucleotide sequence encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table XIII and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table XIII. The 
nucleic acid molecules can comprise DNA molecules or RNA molecules. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 90% identical to a sequence of at least about 10 contiguous amino acids in the amino 
acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table XIII. 

Also preferred is a polypeptide, wherein said sequence of contiguous amino acids is 
included in the amino acid sequence of SEQ ID NO: Y in the range of positions beginning 
with the residue at about the position of the First Amino Acid of the Secreted Portion and 
ending with the residue at about the Last Amino Acid of the Open Reading Frame as set 
forth for SEQ ID NO: Y in Table XIII. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in the amino 
acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 100 contiguous amino acids in the amino 
acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to the complete amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence at 
least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table XIII and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table XIII. 

SUBSTITUTE SHEET (RULE 26) 



W0 00/29435 PCT/US99/25031 

457 

Also preferred is a polypeptide wherein said sequence of contiguous amino acids is 
included in the amino acid sequence of a secreted portion of the secreted protein encoded 
by a human cDNA clone identified by a cDNA Clone Identifier in Table XIII and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table XIII. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in the amino 
acid sequence of the secreted portion of the protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table XIII and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table XIII. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 100 contiguous amino acids in the amino 
acid sequence of the secreted portion of the protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table XIII and contained in the deposit with the 
ATCC Deposit Number shown for said cDN A clone in Table XIII. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to the amino acid sequence of the secreted portion of the protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table XIII and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table XIII. 

Further preferred is an isolated antibody which binds specifically to a polypeptide 
comprising an amino acid sequence that is at least 90% identical to a sequence of at least 
10 contiguous amino acids in a sequence selected from the group consisting of: an amino 
acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table XIII; and a 
complete amino acid sequence of a protein encoded by a human cDNA clone identified by 
a cDNA Clone Identifier in Table XIII and contained in the deposit with the ATCC 
Deposit Number shown for said cDNA clone in Table XIII. 
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Further preferred is a method for detecting in a biological sample a polypeptide 
comprising an amino acid sequence which is at least 90% identical to a sequence of at least 
10 contiguous amino acids in a sequence selected from the group consisting of: an amino 
acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table XIII; and a 
5 complete amino acid sequence of a protein encoded by a human cDNA clone identified by 
a cDNA Clone Identifier in Table XIII and contained in the deposit with the ATCC 
Deposit Number shown for said cDNA clone in Table XIII; which method comprises a 
step of comparing an amino acid sequence of at least one polypeptide molecule in said 
sample with a sequence selected from said group and determining whether the sequence of 
10 said polypeptide molecule in said sample is at least 90% identical to said sequence of at 
least 10 contiguous amino acids. 

Also preferred is the above method wherein said step of comparing an amino acid 
sequence of at least one polypeptide molecule in said sample with a sequence selected 
from said group comprises determining the extent of specific binding of polypeptides in 
15 said sample to an antibody which binds specifically to a polypeptide comprising an amino 
acid sequence that is at least 90% identical to a sequence of at least 10 contiguous amino 
acids in a sequence selected from the group consisting of: an amino acid sequence of SEQ 
ID NO: Y wherein Y is any integer as defined in Table XIII; and a complete amino acid 
sequence of a protein encoded by a human cDNA clone identified by a cDNA Clone 
20 Identifier in Table XIII and contained in the deposit with the ATCC Deposit Number 
shown for said cDNA clone in Table XIII. 

Also preferred is the above method wherein said step of comparing sequences is 
performed by comparing the amino acid sequence determined from a polypeptide molecule 
in said sample with said sequence selected from said group. 
25 Also preferred is a method for identifying the species, tissue or cell type of a 

biological sample which method comprises a step of detecting polypeptide molecules 
said sample, if any, comprising an amino acid sequence that is at least 90% identical to ; 
sequence of at least 10 contiguous amino acids in a sequence selected from the group 
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consisting of: an amino acid sequence of SEQ ID NO:Y wherein Y is any integer as 
defined in Table XIII; and a complete amino acid sequence of a secreted protein encoded 
by a human cDNA clone identified by a cDNA Clone Identifier in Table XHI and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table XIII. 

Also preferred is the above method for identifying the species, tissue or cell type of 
a biological sample, which method comprises a step of detecting polypeptide molecules 
comprising an amino acid sequence in a panel of at least two amino acid sequences, 
wherein at least one sequence in said panel is at least 90% identical to a sequence of at 
least 10 contiguous amino acids in a sequence selected from the above group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted protein 
identified in Table XIII, which method comprises a step of detecting in a biological sample 
obtained from said subject polypeptide molecules comprising an amino acid sequence in a 
panel of at least two amino acid sequences, wherein at least one sequence in said panel is 
at least 90% identical to a sequence of at least 10 contiguous amino acids in a sequence 
selected from the group consisting of: an amino acid sequence of SEQ ID NO:Y wherein Y 
is any integer as defined in Table XIII; and a complete amino acid sequence of a secreted 
protein encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 
XIII and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table XIII. 

In any of these methods, the step of detecting said polypeptide molecules includes 
using an antibody. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a nucleotide sequence encoding a polypeptide 
wherein said polypeptide comprises an amino acid sequence that is at least 90% identical 
to a sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO:Y wherein Y is any integer as 
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defined in Table XIII; and a complete amino acid sequence of a secreted protein encoded 
by a human cDNA clone identified by a cDNA Clone Identifier in Table XIII and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table XIII. 

Also preferred is an isolated nucleic acid molecule, wherein said nucleotide 
sequence encoding a polypeptide has been optimized for expression of said polypeptide in 
a prokaryotic host. 

Also preferred is an isolated nucleic acid molecule, wherein said polypeptide 
comprises an amino acid sequence selected from the group consisting of: an amino acid 
sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table XIII; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table XIII and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table XIII. 

Further preferred is a method of making a recombinant vector comprising inserting 
any of the above isolated nucleic acid molecule into a vector. Also preferred is the 
recombinant vector produced by this method. Also preferred is a method of making a 
recombinant host cell comprising introducing the vector into a host cell, as well as the 
recombinant host cell produced by this method. 

Also preferred is a method of making an isolated polypeptide comprising culturing 
this recombinant host cell under conditions such that said polypeptide is expressed and 
recovering said polypeptide. Also preferred is this method of making an isolated 
polypeptide, wherein said recombinant host cell is a eukaryotic cell and said polypeptide is 
a secreted portion of a human secreted protein comprising an amino acid sequence selected 
from the group consisting of: an amino acid sequence of SEQ ID NO:Y beginning with the 
residue at the position of the First Amino Acid of the Secreted Portion of SEQ ID NO: Y 
wherein Y is an integer set forth in Table XIII and said position of the First Amino Acid of 
the Secreted Portion of SEQ ID NO: Y is defined in Table XIII; and an amino acid 
sequence of a secreted portion of a protein encoded by a human cDNA clone identified by 
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acDNA Clone Identifier in Table XIII and contained in the deposit with the ATCC 
Deposit Number shown for said cDNA clone in Table XIII. The isolated polypeptide 
produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an increased 
level of a secreted protein activity, which method comprises administering to such an 
individual a pharmaceutical composition comprising an amount of an isolated polypeptide, 
polynucleotide, or antibody of the claimed invention effective to increase the level of said 
protein activity in said individual. 

The above-recited applications have uses in a wide variety of hosts. Such hosts 
include, but are not limited to, human, murine, rabbit, goat, guinea pig, camel, horse, 
mouse, rat, hamster, pig, micro-pig, chicken, goat, cow, sheep, dog, cat, non-human 
primate, and human. In specific embodiments, the host is a mouse, rabbit, goat, guinea 
pig, chicken, rat, hamster, pig, sheep, dog or cat. In preferred embodiments, the host is a 
mammal. In most preferred embodiments, the host is a human. 

Having generally described the invention, the same will be more readily understood 
by reference to the following examples, which are provided by way of illustration and are 
not intended as limiting. 



Examples 

Example 1: Nation of a Selected cDNA Clon* F rom the Opp n^it^ 

Each cDNA clone in a cited ATCC deposit is contained in a plasmid vector. Table 
XIII identifies the vectors used to construct the cDNA library from which each clone was 
isolated. In many cases, the vector used to construct the library is a phage vector from 
which a plasmid has been excised. The table immediately below correlates the related 
plasmid for each phage vector used in constructing the cDNA library. For example, where 
a particular clone is identified in Table XIII as being isolated in the vector "Lambda Zap," 
the corresponding deposited clone is in "pBluescript." 
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iSSjxR pBluescript(pBS) 
ZapiS Prescript ( P BS) 

n P r^ P °^S PCMVSport 2.0 

K. POrt30 pCMVSport3.0 
pCR 2.1 pCR®2.1 

Vector Lambda Zap (U.S. Patent Nos. 5.128,256 and 5,286,636), Uni-Zap XR 
~ (U.S. Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 5,128,256 and 
- 5,286,636), pBluescript (pBS) (Short, J. M. et al., Nucleic Acids Res. 16:7583-7600 
(1988); Alting-Mees, M. A. and Short, J. M, Nucleic Acids Res. 17:9494 (1989)) and pBK 
(Alting-Mees, M. A. et al., Strategies 5:58-61 (1992)) are commercially available from 
Stratagene Cloning Systems, Inc., 1 101 1 N. Torrey Pines Road, La Jolla, CA, 92037. pBS 
contains an ampicillin resistance gene and pBK contains a neomycin resistance gene. Both 
can be transformed into E. coli strain XL-1 Blue, also available from Stratagene. pBS 
comes in 4 forms SK + , SK-, KS + and KS. The S and K refers to the orientation of the 
polylinker to the T7 and T3 primer sequences which flank the polylinker region ("S" is for 
Sad and "K" is for Kpnl which are the first sites on each respective end of the linker). V 
or --" refer to the orientation of the fl origin of replication fori"), such that in one 
orientation, single stranded rescue initiated from the f 1 ori generates sense strand DNA and 
in the other, antisense. 

Vectors pSportl, pCMVSport 2.0 and pCMVSport 3.0, were obtained from Life 
Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors contain 
an ampicillin resistance gene and may be transformed into E. coli strain DH10B, also 
available from Life Technologies. (See, for instance, Gruber, C. E., et al., Focus 15:59 
(1993).) Vector lafmid BA (Bento Soares, Columbia University, NY) contains an 
ampicillin resistance gene and can be transformed into E. coli strain XL-1 Blue. Vector 
pCR®2.1. which is available from Invitrogen, 1600 Faraday Avenue, Carlsbad, CA 92008, 
contains an ampicillin resistance gene and may be transformed into E. coli strain DH10B, 
available from Life Technologies. (See, for instance, Clark, J. M., Nuc. Acids Res. 
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16:9677-9686 (1988) and Mead, D. et al., Bio/Technology 9: (1991).) Preferably, a 
polynucleotide of the present invention does not comprise the phage vector sequences 
identified for the particular clone in Table XTH, as well as the corresponding plasmid 
vector sequences designated above. 

The deposited material in the sample assigned the ATCC Deposit Number cited in 
Table XIII for any given cDNA clone also may contain one or more additional plasmids, 
each comprising a cDNA clone different from that given clone. Thus, deposits sharing the 
same ATCC Deposit Number contain at least a plasmid for each cDNA clone identified in 
Table XIII. Typically, each ATCC deposit sample cited in Table XIII comprises a mixture 
of approximately equal amounts (by weight) of about 50 plasmid DNAs, each containing a 
different cDNA clone; but such a deposit sample may include plasmids for more or less 
than 50 cDNA clones, up to about 500 cDNA clones. 

Two approaches can be used to isolate a particular clone from the deposited sample 
of plasmid DNAs cited for that clone in Table XIII. First, a plasmid is directly isolated by 
screening the clones using a polynucleotide probe corresponding to SEQ ID NO:X. 

Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized using 
an Applied Biosystems DNA synthesizer according to the sequence reported. The 
oligonucleotide is labeled, for instance, with "P- Y -ATP using T4 polynucleotide kinase and 
purified according to routine methods. (E.g., Maniatis et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) The plasmid 
mixture is transformed into a suitable host, as indicated above (such as XL-1 Blue 
(Stratagene)) using techniques known to those of skill in the art, such as those provided by 
the vector supplier or in related publications or patents cited above. The transformants are 
plated on 1 .5% agar plates (containing the appropriate selection agent, e.g., ampicillin) to a 
density of about 150 transformants (colonies) per plate. These plates are screened using 
Nylon membranes according to routine methods for bacterial colony screening (e.g., 
Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Edit., (1989), Cold Spring 
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Harbor Laboratory Press, pages 1.93 to 1 . 104), or other techniques known to those of skill 
in the art. 

Alternatively, two primers of 17-20 nucleotides derived from both ends of the SEQ 
ID NO:X (i.e., within the region of SEQ ID NO:X bounded by the 5' NT and the 3' NT 
of the clone defined in Table XIII) are synthesized and used to amplify the desired cDNA 
using the deposited cDNA plasmid as a template. The polymerase chain reaction is carried 
out under routine conditions, for instance, in 25 ul of reaction mixture with 0.5 ug of the 
above cDNA template. A convenient reaction mixture is 1.5-5 mM MgCl 2 , 0.01% (w/v) 
gelatin, 20 uM each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 0.25 Unit 
of Taq polymerase. Thirty five cycles of PCR (denaturation at 94 degree C for 1 min; 
annealing at 55 degree C for 1 min; elongation at 72 degree C for 1 min) are performed 
with a Perkin-Elmer Cetus automated thermal cycler. The amplified product is analyzed 
by agarose gel electrophoresis and the DNA band with expected molecular weight is 
excised and purified. The PCR product is verified to be the selected sequence by 
subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding 
portions of a gene which may not be present in the deposited clone. These methods include 
but are not limited to, filter probing, clone enrichment using specific probes, and protocols 
similar or identical to 5' and 3' "RACE" protocols which are well known in the art. For 
instance, a method similar to 5' RACE is available for generating the missing 5' end of a 
desired full-length transcript. (Fromont-Racine et al., Nucleic Acids Res. 21(7): 1683-1684 
(1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a population of 
RNA presumably containing full-length gene RNA transcripts. A primer set containing a 
primer specific to the ligated RNA oligonucleotide and a primer specific to a known 
sequence of the gene of interest is used to PCR amplify the 5' portion of the desired full- 
length gene. This amplified product may then be sequenced and used to generate the full 
length gene. 
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This above method starts with total RNA isolated from the desired source, although 
poly-A+ RNA can be used. The RNA preparation can then be treated with phosphatase if 
necessary to eliminate 5' phosphate groups on degraded or damaged RNA which may 
interfere with the later RNA ligase step. The phosphatase should then be inactivated and 
5 the RNA treated with tobacco acid pyrophosphatase in order to remove the cap structure 
present at the 5' ends of messenger RNAs. This reaction leaves a 5' phosphate group at 
the 5' end of the cap cleaved RNA which can then be iigated to an RNA oligonucleotide 
using T4 RNA ligase. 

This modified RNA preparation is used as a template for first strand cDNA 
10 synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is used 
as a template for PCR amplification of the desired 5' end using a primer specific to the 
Iigated RNA oligonucleotide and a primer specific to the known sequence of the gene of 
interest. The resultant product is then sequenced and analyzed to confirm that the 5' end 
sequence belongs to the desired gene. 



15 



20 



Example 2: Isolation of Genomic Cl o nes Corresponding to a P olynucleotide 

A human genomic PI library (Genomic Systems, Inc.) is screened by PCR using 
primers selected for the cDNA sequence corresponding to SEQ ID NO:X., according to the 
method described in Example 1. (See also, Sambrook.) 



Example 3; Tissue Distribution of Polypeptide 

Tissue distribution of mRNA expression of polynucleotides of the present 
invention is determined using protocols for Northern blot analysis, described by, among 
others, Sambrook et al. For example, a cDNA probe produced by the method described in 
25 Example 1 is labeled with P 32 using the rediprime™ DNA labeling system (Amersham 
Life Science), according to manufacturer's instructions. After labeling, the probe is 
purified using CHROMA SPIN-100™ column (Clontech Laboratories, Inc.), according to 
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manufacturer's protocol number PTI200-1. The purified labeled probe is then used to 
examine various human tissues for mRNA expression. 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) or 
human immune system tissues (IM) (Clontech) are examined with the labeled probe using 
ExpressHyb™ hybridization solution (Clontech) according to manufacturer's protocol 
number PT1 190-1. Following hybridization and washing, the blots are mounted and 
exposed to film at -70 degree C overnight, and the films developed according to standard 
procedures. 

Example 4; Chromosomal Mappii^ 0 f the Pniy^ .ngAtMoc 

An oligonucleotide primer set is designed according to the sequence at the 5' end of 
SEQ ID NO:X. This primer preferably spans about 100 nucleotides. This primer set is 
then used in a polymerase chain reaction under the following set of conditions : 30 
seconds,95 degree C; 1 minute, 56 degree C; 1 minute, 70 degree C. This cycle is repeated 
32 times followed by one 5 minute cycle at 70 degree C. Human, mouse, and hamster 
DNA is used as template in addition to a somatic cell hybrid panel containing individual 
chromosomes or chromosome fragments (Bios, Inc). The reactions is analyzed on either 
8% polyacrylamide gels or 3.5 % agarose gels. Chromosome mapping is determined by 
the presence of an approximately 100 bp PCR fragment in the particular somatic cell 
hybrid. 

Example S: Bacterial Fy pressinn nf a Poly peptjHp 

A polynucleotide encoding a polypeptide of the present invention is amplified 
using PCR oligonucleotide primers corresponding to the 5' and 3' ends of the DNA 
sequence, as outlined in Example 1, to synthesize insertion fragments. The primers used to 
amplify the cDNA insert should preferably contain restriction sites, such as BamHl and 
Xbal, at the 5' end of the primers in order to clone the amplified product into the 
expression vector. For example, BarnHJ and Xbal correspond to the restriction enzyme 
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sites on the bacterial expression vector pQE-9. (Qiagen, Inc., Chatsworth, CA). This 
plasmid vector encodes antibiotic resistance (AmpO, a bacterial origin of replication (ori), 
an IPTG-regulatable promoter/operator (P/O), a ribosome binding site (RBS), a 6-histidine 
tag (6-His), and restriction enzyme cloning sites. 

The pQE-9 vector is digested with BamHI and Xbal and the amplified fragment is 
ligated into the pQE-9 vector maintaining the reading frame initiated at the bacterial RBS. 
The ligation mixture is then used to transform the E. coli strain M15/rep4 (Qiagen, Inc.) 
which contains multiple copies of the plasmid pREP4, which expresses the lad repressor 
and also confers kanamycin resistance (Kan*-). Transformants are identified by their ability 
to grow on LB plates and ampicillin/kanamycin resistant colonies are selected. Plasmid 
DNA is isolated and confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (O/N) in liquid 
culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). The 
O/N culture is used to inoculate a large culture at a ratio of 1: 100 to 1:250. The cells are 
grown to an optical density 600 (O.D. 600 ) of between 0.4 and 0.6. IPTG (Isopropyl-B-D- 
thiogalacto pyranoside) is then added to a final concentration of 1 mM. IPTG induces by 
inactivating the lad repressor, clearing the P/O leading to increased gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
centrifugation (20 mins at 6000Xg). The cell pellet is solubilized in the chaotropic agent 6 
Molar Guanidine HC1 by stirring for 3-4 hours at 4 degree C. The cell debris is removed 
by centrifugation, and the supernatant containing the polypeptide is loaded onto a nickel- 
nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (available from QIAGEN, Inc., 
supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high affinity and can be 
purified in a simple one-step procedure (for details see: The QIAexpressionist (1995) 
QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCI, pH 8, the 
column is first washed with 10 volumes of 6 M guanidine-HCI, pH 8, then washed with 10 
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volumes of 6 M guanidine-HCI pH 6, and finally the polypeptide is eluted with 6 M 
guanidine-HCI, pH 5. 

The purified protein is then renatured by dialyzing it against phosphate-buffered 
saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCI. Alternatively, the 
protein can be successfully refolded while immobilized on the Ni-NTA column. The 
recommended conditions are as follows: renature using a linear 6M-IM urea gradient in 
500 mM NaCI, 20% glycerol, 20 mM Tris/HCI pH 7.4, containing protease inhibitors. The 
renaturation should be performed over a period of 1 .5 hours or more. After renaturation 
the proteins are eluted by the addition of 250 mM immidazole. Immidazole is removed by 
a final dialyzing step against PBS or 50 mM sodium acetate pH 6 buffer plus 200 mM 
NaCI. The purified protein is stored at 4 degree C or frozen at -80 degree C. 

In addition to the above expression vector, the present invention further includes an 
expression vector comprising phage operator and promoter elements operati vely linked to 
a polynucleotide of the present invention, called pHE4a. (ATCC Accession Number 
209645, deposited on February 25, 1998.) This vector contains: 1) a 
neomycinphosphotransferase gene as a selection marker, 2) an E. coli origin of replication, 
3) a T5 phage promoter sequence, 4) two lac operator sequences, 5) a Shine-Delgarno 
sequence, and 6) the lactose operon repressor gene (laclq). The origin of replication (oriC) 
is derived from pUC19 (LTI, Gaithersburg, MD). The promoter sequence and operator 
sequences are made synthetically. 

DNA can be inserted into the pHEa by restricting the vector with Ndel and Xbal, 
BamHI, Xhol, or Asp718, running the restricted product on a gel, and isolating the larger 
fragment (the stuffer fragment should be about 310 base pairs). The DNA insert is 
generated according to the PCR protocol described in Example 1, using PCR primers 
having restriction sites for Ndel (5' primer) and Xbal, BamHI, Xhol, or Asp718 (3' 
primer). The PCR insert is gel purified and restricted with compatible enzymes. The 
insert and vector are ligated according to standard protocols. 
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The engineered vector could easily be substituted in the above protocol to express 
protein in a bacterial system. 

Example 6; Purification of a Polypep tid e from an Inclusion Bnriv 

The following alternative method can be used to purify a polypeptide expressed in 
E coli when it is present in the form of inclusion bodies. Unless otherwise specified, all of 
the following steps are conducted at 4-10 degree C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-10 degree C and the cells harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
weight of cell paste and the amount of purified protein required, an appropriate amount of 
cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 mM 
EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a high shear 
mixer. 

The cells are then lysed by passing the solution through a microfluidizer 
(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is 
then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 xg for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA; pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine 
hydrochloride (GuHCI) for 2-4 hours. After 7000 xg centrifugation for 15 min., the pellet 
is discarded and the polypeptide containing supernatant is incubated at 4 degree C 
overnight to allow further GuHCI extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble particles, the 
GuHCI solubilized protein is refolded by quickly mixing the GuHCI extract with 20 
volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by 
vigorous stirring. The refolded diluted protein solution is kept at 4 degree C without 
mixing for 12 hours prior to further purification steps. 

SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 

PCT/US99/2503I 

470 

To clarify the refolded polypeptide solution, a previously prepared tangential 
nitration unit equipped with 0.16 um membrane filter with appropriate surface area (e.g., 
Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered 
sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive Biosystems). 
5 The column is washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM, 500 
mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a stepwise manner. The 
absorbance at 280 nm of the effluent is continuously monitored. Fractions are collected 
and further analyzed by SDS-PAGE. 

Fractions containing the polypeptide are then pooled and mixed with 4 volumes of 
10 water. The diluted sample is then loaded onto a previously prepared set of tandem 

columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros 
CM-20, Perseptive Biosystems) exchange resins. The columns are equilibrated with 40 
mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 
6.0, 200 mM NaCl. The CM-20 column is then eluted using a 10 column volume linear 
i gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM 
sodium acetate, pH 6.5. Fractions are collected under constant A, monitoring of the 
effluent. Fractions containing the polypeptide (determined, for instance, by 16% SDS- 
PAGE) are then pooled. 

The resultant polypeptide should exhibit greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands should be observed from 
Commassie blue stained 16% SDS-PAGE gel when 5 ug of purified protein is loaded. The 
purified protein can also be tested for endotoxin/LPS contamination, and typically the LPS 
content is less than 0.1 ng/ml according to LAL assays. 

Example 7; Qoninp an d FWrfam of » t a Rar „i nviri ,, 

System 

In this example, the plasmid shuttle vector pA2 is used to insert a polynucleotide 
into a baculovirus to express a polypeptide. This expression vector contains the strong 
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polyhedrin promoter of the Autographa californica nuclear polyhedrosis virus (AcMNPV) 
followed by convenient restriction sites such as BamHI, Xba I and Asp718. The 
polyadenylation site of the simian virus 40 ("SV40" ) is used for efficient polyadenylation. 
For easy selection of recombinant virus, the plasmid contains the beta-galactosidase gene 
from E. coli under control of a weak Drosophila promoter in the same orientation, 
followed by the polyadenylation signal of the polyhedrin gene. The inserted genes are 
flanked on both sides by viral sequences for cell-mediated homologous recombination with 
wild-type viral DNA to generate a viable virus that express the cloned polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, such as 
P Ac373, pVL941, and pAcIMl, as one skilled in the art would readily appreciate, as long 
as the construct provides appropriately located signals for transcription, translation, 
secretion and the like, including a signal peptide and an in-frame AUG as required. Such 
vectors are described, for instance, in Luckow et al., Virology 170:31-39 (1989). 

Specifically, the cDNA sequence contained in the deposited clone, including the 
AUG initiation codon and the naturally associated leader sequence identified in Table XIII, 
is amplified using the PCR protocol described in Example 1. If the naturally occurring 
signal sequence is used to produce the secreted protein, the pA2 vector does not need a 
second signal peptide. Alternatively, the vector can be modified (pA2 GP) to include a 
baculovirus leader sequence, using the standard methods described in Summers et al., "A 
Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures," Texas 
Agricultural Experimental Station Bulletin No. 1555 (1987). 

The amplified fragment is isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1% agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and optionally, 
can be dephosphorylated using calf intestinal phosphatase, using routine procedures known 
in the art. The DNA is then isolated from a 1% agarose gel using a commercially available 
kit ("Geneclean" BIO 101 Inc., La Jolla, Ca.). 
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The fragment and the dephosphorylated plasmid are ligated together with T4 DNA 
ligase. E. coli HB 101 or other suitable E. coli hosts such as XL- 1 Blue (Stratagene 
Cloning Systems, La Jolla, CA) cells are transformed with the ligation mixture and spread 
on culture plates. Bacteria containing the plasmid are identified by digesting DNA from 
individual colonies and analyzing the digestion product by gel electrophoresis. The 
sequence of the cloned fragment is confirmed by DNA sequencing. 

Five ug of a plasmid containing the polynucleotide is co-transfected with 1.0 ug of 
a commercially available linearized baculovirus DNA ("BaculoGold™ baculovirus DNA", 
Pharmingen, San Diego, CA), using the lipofection method described by Feigner et al., 
Proc. Natl. Acad. Sci. USA 84:7413-7417 (1987). One ug of BaculoGold™ virus DNA 
and 5 ug of the plasmid are mixed in a sterile well of a microliter plate containing 50 ul of 
serum-free Grace's medium (Life Technologies Inc., Gaithersburg, MD). Afterwards, 10 
ul Lipofectin plus 90 ul Grace's medium are added, mixed and incubated for 15 minutes at 
room temperature. Then the transfection mixture is added drop-wise to Sf9 insect cells 
(ATCC CRL 1711) seeded in a 35 mm tissue culture plate with 1 ml Grace's medium 
without serum. The plate is then incubated for 5 hours at 27 degrees C. The transfection 
solution is then removed from the plate and 1 ml of Grace's insect medium supplemented 
with 10% fetal calf serum is added. Cultivation is then continued at 27 degrees C for four 
days. 

After four days the supernatant is collected and a plaque assay is performed, as 
described by Summers and Smith, supra. An agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of gal- 
expressing clones, which produce blue-stained plaques. (A detailed description of a 
"plaque assay" of this type can also be found in the user's guide for insect cell culture and 
baculovirology distributed by Life Technologies Inc., Gaithersburg, page 9-10.) After 
appropriate incubation, blue stained plaques are picked with the tip of a micropipettor (e.g., 
Eppendorf). The agar containing the recombinant viruses is then resuspended in a 
microcentrifuge tube containing 200 ul of Grace's medium and the suspension containing 
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the recombinant baculovirus is used to infect Sf9 cells seeded in 35 mm dishes. Four days 
later the supernatants of these culture dishes are harvested and then they are stored at 4 
degree C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace's medium 
supplemented with 10% heat-inactivated FBS. The cells are infected with the recombinant 
baculovirus containing the polynucleotide at a multiplicity of infection ("MOI") of about 2. 
If radiolabeled proteins are desired, 6 hours later the medium is removed and is replaced 
with SF900 II medium minus methionine and cysteine (available from Life Technologies 
Inc., Rockville, MD). After 42 hours, 5 uCi of 55 S -methionine and 5 uCi ,5 S-cysteine 
(available from Amersham) are added. The cells are further incubated for 16 hours and 
then are harvested by centrifugation. The proteins in the supernatant as well as the 
intracellular proteins are analyzed by SDS-PAGE followed by autoradiography (if 
radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of purified 
protein may be used to determine the amino terminal sequence of the produced protein. 



Example 8: Expression of a Polypep tide in Mammalian f>|ic 

The polypeptide of the present invention can be expressed in a mammalian cell. A 
typical mammalian expression vector contains a promoter element, which mediates the 
initiation of transcription of mRNA, a protein coding sequence, and signals required for the 
termination of transcription and polyadenylation of the transcript. Additional elements 
include enhancers, Kozak sequences and intervening sequences flanked by donor and 
acceptor sites for RNA splicing. Highly efficient transcription is achieved with the early 
and late promoters from SV40, the long terminal repeats (LTRs) from Retroviruses, e.g., 
RSV, HTLVI, HIVI and the early promoter of the cytomegalovirus (CMV). However, 
cellular elements can also be used (e.g., the human actin promoter). 
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Suitable expression vectors for use in practicing the present invention include, for 
example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), pRSVcat 
(ATCC 37152), pSV2dhfr (ATCC 37146), pBC12MI (ATCC 67109), pCMVSport 2.0, 
and pCMVSport 3.0. Mammalian host cells that could be used include, human Hela, 293, 
H9 and Jurkat cells, mouse NIH3T3 and C127 cells, Cos 1, Cos 7 and CV1, quail QC1-3 
cells, mouse L cells and Chinese hamster ovary (CHO) cells. 

Alternatively, the polypeptide can be expressed in stable cell lines containing the 
polynucleotide integrated into a chromosome. The co-transfection with a selectable 
marker such as dhfr, gpt, neomycin, hygromycin allows the identification and isolation of 
the transfected cells. 

The transfected gene can also be amplified to express large amounts of the encoded 
protein. The DHFR (dihydrofolate reductase) marker is useful in developing cell lines that 
carry several hundred or even several thousand copies of the gene of interest. (See, e.g.. 
Alt, F. W., et al., J. Biol. Chem. 253:1357-1370 (1978); Hamlin, J. L. and Ma, C, 
Biochem. et Biophys. Acta, 1097:107-143 (1990); Page, M. J. and Sydenham, M. A., 
Biotechnology 9:64-68 (1991).) Another useful selection marker is the enzyme glutamine 
synthase (GS) (Murphy et al., Biochem J. 227:277-279 (1991); Bebbington et al., 
Bio/Technology 10:169-175 (1992). Using these markers, the mammalian cells are grown 
in selective medium and the cells with the highest resistance are selected. These cell lines 
contain the amplified gene(s) integrated into a chromosome. Chinese hamster ovary 
(CHO) and NSO cells are often used for the production of proteins. 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 
No.209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (Cullen et al., 
Molecular and Cellular Biology, 438-447 (March, 1985)) plus a fragment of the CMV- 
enhancer (Boshart et al., Cell 41:521-530 (1985).) Multiple cloning sites, e.g., with the 
restriction enzyme cleavage sites BamHl, Xbal and Asp71 8, facilitate the cloning of the 
gene of interest. The vectors also contain the 3' intron, the polyadenylation and 
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termination signal of the rat preproinsulin gene, and the mouse DHFR gene under control 
of the S V40 early promoter. 

Specifically, the plasmid pC6, for example, is digested with appropriate restriction 
enzymes and then dephosphorylated using calf intestinal phosphates by procedures known 
in the art. The vector is then isolated from a 1% agarose gel. 

A polynucleotide of the present invention is amplified according to the protocol 
outlined in Example 1 . If the naturally occurring signal sequence is used to produce the 
secreted protein, the vector does not need a second signal peptide. Alternatively, if the 
naturally occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See, e.g., WO 96/3489 1 .) 

The amplified fragment is isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1% agarose gel. 

The amplified fragment is then digested with the same restriction enzyme and 
purified on a 1% agarose gel. The isolated fragment and the dephosphorylated vector are 
then ligated with T4 DNA ligase. E. coli HBI01 or XL-1 Blue cells are then transformed 
and bacteria are identified that contain the fragment inserted into plasmid pC6 using, for 
instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene is used for transfection. 
Five ng of the expression plasmid pC6 a pC4 is cotransfected with 0.5 ug of the plasmid 
pSVneo using Iipofectin (Feigner et al., supra). The plasmid pSV2-neo contains a 
dominant selectable marker, the neo gene from Tn5 encoding an enzyme that confers 
resistance to a group of antibiotics including G418. The cells are seeded in alpha minus 
MEM supplemented with 1 mg/ml G418. After 2 days, the cells are trypsinized and 
seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus MEM 
supplemented with 10, 25, or 50 ng/ml of metothrexate plus 1 mg/ml G418. After about 
10-14 days single clones are trypsinized and then seeded in 6-well petri dishes or 10 ml 
flasks using different concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 nM, 

SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 

PCT/US99/25031 

476 

800 nM). Clones growing at the highest concentrations of methotrexate are then 
transferred to new 6-well plates containing even higher concentrations of methotrexate (1 
uM, 2 uM, 5 uM, 10 mM, 20 mM). The same procedure is repeated until clones are 
obtained which grow at a concentration of 100 - 200 uM. Expression of the desired gene 
product is analyzed, for instance, by SDS-PAGE and Western blot or by reversed phase 
HPLC analysis. 



Example 9: Pp^n f"-^nt 

The polypeptides of the present invention are preferably fused to other proteins. 
10 These fusion proteins can be used for a variety of applications. For example, fusion of the 
present polypeptides to His-tag, HA-tag, protein A, IgG domains, and maltose binding 
protein facilitates purification. (See Example 5; see also EP A 394,827; Traunecker, et al., 
Nature 331:84-86 (1988).) Similarly, fusion to IgG-1, IgG-3, and albumin increases the 
halflife time in vivo. Nuclear localization signals fused to the polypeptides of the present 
15 invention can target the protein to a specific subcellular localization, while covalent 
heterodimer or homodimers can increase or decrease the activity of a fusion protein. 
Fusion proteins can also create chimeric molecules having more than one function. 
Finally, fusion proteins can increase solubility and/or stability of the fused protein 
compared to the non-fused protein. All of the types of fusion proteins described above can 
20 be made by modifying the following protocol, which outlines the fusion of a polypeptide to 
an IgG molecule, or the protocol described in Example 5. 

Briefly, the human Fc portion of the IgG molecule can be PCR amplified, using 
primers that span the 5' and 3' ends of the sequence described below. These primers also 
should have convenient restriction enzyme sites that will facilitate cloning into an 
25 expression vector, preferably a mammalian expression vector. 

For example, if pC4 (Accession No. 209646) is used, the human Fc portion can be 
ligated into the BamHI cloning site. Note that the 3' BamHI site should be destroyed. 
Next, the vector containing the human Fc portion is re-restricted with BamHI, linearizing 
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the vector, and a polynucleotide of the present invention, isolated by the PCR protocol 
described in Example 1, is ligated into this BamHI site. Note that the polynucleotide is 
cloned without a stop codon, otherwise a fusion protein will not be produced. 

If the naturally occurring signal sequence is used to produce the secreted protein, 
P C4 does not need a second signal peptide. Alternatively, if the naturally occurring signal 
sequence is not used, the vector can be modified to include a heterologous signal sequence. 
(See, e.g., WO 96/34891.) 

Human IgG Fc region: 

GGGATCCGGAGCCCAAATCTTCTGACAAAACTCACACATGCCCACCGTGCCCA 

GCACCTGAATTCGAGGGTGCACCGTCAGTCTTCCTCTTCCCCCCAAAACCCAA 

GGACACCCTCATGATCTCCCGGACTCCTGAGGTCACATGCGTGGTGGTGGACG 

TAAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGA 

GGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTAC 

CGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGA 

GTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAACCCCCATCGAGAAAACCA 

TCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCA 

TCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGG 

CTTCTATCCAAGCGACATCGGCGTGGAGTGGGAGAGCAATGGGCAGCCGGAG 

AACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTC 

TACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCT 

CATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTC 

TCCCTGTCTCCGGGTAAATGAGTGCGACGGCCGCGACTCTAGAGGAT (SEQ ID 
NO:l) 

Emmpk 10? Production of an AntihoHv fm m a p n i V p B pt i(l< > 

The antibodies of the present invention can be prepared by a variety of methods. 
(See, Current Protocols, Chapter 2.) As one example of such methods, cells expressing a 
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polypeptide of the present invention is administered to an animal to induce the production 
of sera containing polyclonal antibodies. In a preferred method, a preparation of the 
secreted protein is prepared and purified to render it substantially free of natural 
contaminants. Such a preparation is then introduced into an animal in order to produce 
polyclonal antisera of greater specific activity. 

In the most preferred method, the antibodies of the present invention are 
monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et al., Nature 256:495 

(1975) ; Kohler et al., Eur. J. Immunol. 6:51 1 (1976); Kohler et al., Eur. J. Immunol. 6:292 

(1976) ; Hammerling et al., in: Monoclonal Antibodies and T-Cell Hybridomas, Elsevier, 
N.Y., pp. 563-681 (1981).) In general, such procedures involve immunizing an animal 
(preferably a mouse) with polypeptide or, more preferably, with a secreted polypeptide- 
expressing cell. Such cells may be cultured in any suitable tissue culture medium; 
however, it is preferable to culture cells in Earle's modified Eagle's medium supplemented 
with 10% fetal bovine serum (inactivated at about 56 degrees C), and supplemented with 
about 10 g/1 of nonessential amino acids, about 1,000 U/ml of penicillin, and about 100 
ug/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma cell 
line. Any suitable myeloma cell line may be employed in accordance with the present 
invention; however, it is preferable to employ the parent myeloma cell line (SP20), 
available from the ATCC. After fusion, the resulting hybridoma cells are selectively 
maintained in HAT medium, and then cloned by limiting dilution as described by Wands et 
al. (Gastroenterology 80:225-232 (1981).) The hybridoma cells obtained through such a 
selection are then assayed to identify clones which secrete antibodies capable of binding 
the polypeptide. 

Alternatively, additional antibodies capable of binding to the polypeptide can be 
produced in a two-step procedure using anti-idiotypic antibodies. Such a method makes 
use of the fact that antibodies are themselves antigens, and therefore, it is possible to obtain 
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an antibody which binds to a second antibody. In accordance with this method, protein 
specific antibodies are used to immunize an animal, preferably a mouse. The splenocytes 
of such an animal are then used to produce hybridoma cells, and the hybridoma cells are 
screened to identify clones which produce an antibody whose ability to bind to the protein- 
specific antibody can be blocked by the polypeptide. Such antibodies comprise anti- 
idiotypic antibodies to the protein-specific antibody and can be used to immunize an 
animal to induce formation of further protein-specific antibodies. 

It will be appreciated that Fab and F(ab')2 and other fragments of the antibodies of 
the present invention may be used according to the methods disclosed herein. Such 
fragments are typically produced by proteolytic cleavage, using enzymes such as papain 
(to produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, 
secreted protein-binding fragments can be produced through the application of 
recombinant DNA technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use "humanized" 
chimeric monoclonal antibodies. Such antibodies can be produced using genetic 
constructs derived from hybridoma cells producing the monoclonal antibodies described 
above. Methods for producing chimeric antibodies are known in the art. (See, for review, 
Morrison, Science 229:1202 (1985); Oi et al., BioTechniques 4:214 (1986); Cabilly et al., 
U.S. Patent No. 4,816,567; Taniguchi et al., EP 171496; Morrison et al., EP 173494; 
Neuberger et al., WO 8601533; Robinson et al., WO 8702671 ; Boulianne et al., Nature 
312:643 (1984); Neuberger et al., Nature 314:268 (1985).) 

Example 11: Production Of Secreted Prntrin Fnr H igh-Thronp hput Stt™nrc 
Assays 

The following protocol produces a supernatant containing a polypeptide to be 
tested. This supernatant can then be used in the Screening Assays described in Examples 
13-20. 
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First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 
(ltng/ml in PBS) 1 :20 in PBS (w/o calcium or magnesium 17-516F Biowhittaker) for a 
working solution of 50ug/ml. Add 200 ul of this solution to each well (24 well plates) and 
incubate at RT for 20 minutes. Be sure to distribute the solution over each well (note: a 
12-channel pipetter may be used with tips on every other channel). Aspirate off the Poly- 
D-Lysine solution and rinse with 1ml PBS (Phosphate Buffered Saline). The PBS should 
remain in the well until just prior to plating the cells and plates may be poly-lysine coated 
in advance for up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10 s cells/well in .5ml 
DMEM(Dulbecco's Modified Eagle Medium)(with 4.5 G/L glucose and L-glutamine (12- 
604F Biowhittaker))/10% heat inactivated FBS(14-503F Biowhittaker)/! x Penstrep( 17- 
602E Biowhittaker). Let the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofectamine 
(18324-012 Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well plate. With 
a small volume multi-channel pipetter, aliquot approximately 2ug of an expression vector 
containing a polynucleotide insert, produced by the methods described in Examples 8 or 9, 
into an appropriately labeled 96-well round bottom plate. With a multi-channel pipetter, 
add 50ul of the Lipofectamine/Optimem I mixture to each well. Pipette up and down 
gently to mix. Incubate at RT 15-45 minutes. After about 20 minutes, use a multi-channel 
pipetter to add 150ul Optimem I to each well. As a control, one plate of vector DNA 
lacking an insert should be transfected with each set of transfections. 

Preferably, the transfection should be performed by tag-teaming the following 
tasks. By tag-teaming, hands on time is cut in half, and the cells do not spend too much 
time on PBS. First, person A aspirates off the media from four 24-well plates of cells, and 
then person B rinses each well with .5-lml PBS. Person A then aspirates off PBS rinse, 
and person B, using al2-channel pipetter with tips on every other channel, adds the 200ul 
of DNA/Lipofectamine/Optimem I complex to the odd wells first, then to the even wells, 
to each row on the 24-well plates. Incubate at 37 degrees C for 6 hours. 
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While cells are incubating, prepare appropriate media, either 1%BSA in DMEM 
with lx penstrep, or CHO-5 media (1 16.6 mg/L of CaC12 (anhyd); 0.00130 mg/L CuS0 4 - 
5H 2 0; 0.050 mg/L of Fe(N0 3 ) 3 -9H 2 0; 0.417 mg/L of FeS0 4 -7H 2 0; 31 1.80 mg/L of Kcl; 
28.64 mg/L of MgCl 2 ; 48.84 mg/L of MgS0 4 ; 6995.50 mg/L of NaCl; 2400.0 mg/L of 
5 NaHC0 3 ; 62.50 mg/L of NaH 2 PO 4 -H 2 0; 71.02 mg/L of Na,HP04; .4320 mg/L of ZnS0 4 - 
7H 2 0; .002 mg/L of Arachidonic Acid ; 1.022 mg/L of Cholesterol; .070 mg/L of DL- 
alpha-TocopheroI-Acetate; 0.0520 mg/L of Linoleic Acid; 0.010 mg/L of Linolenic Acid; 
0.010 mg/L of Myristic Acid; 0.010 mg/L of Oleic Acid; 0.010 mg/L of Palmitric Acid; 
0.010 mg/L of Palmitic Acid; 100 mg/L of Pluronic F-68; 0.010 mg/L of Stearic Acid; 

10 2.20 mg/L of Tween 80; 4551 mg/L of D-Glucose; 130.85 mg/ml of L- Alanine; 147.50 
mg/ml of L-Arginine-HCL; 7.50 mg/ml of L-Asparagine-H 2 0; 6.65 mg/ml of L-Aspartic 
Acid; 29.56 mg/ml of L-Cystine-2HCL-H 2 0; 31.29 mg/ml of L-Cystine-2HCL; 7.35 mg/ml 
of L-Glutamic Acid; 365.0 mg/ml of L-Glutamine; 18.75 mg/ml of Glycine; 52.48 mg/ml 
of L-Histidine-HCL-H 2 0; 106.97 mg/ml of L-Isoleucine; 1 1 1.45 mg/ml of L-Leucine; 

15 163.75 mg/ml of L-Lysine HCL; 32.34 mg/ml of L-Methionine; 68.48 mg/ml of L- 

Phenylalainine; 40.0 mg/ml of L-Proline; 26.25 mg/ml of L-Serine; 101.05 mg/ml of L- 
Threonine; 19.22 mg/ml of L-Tryptophan; 91.79 mg/ml of L-Tryrosine-2Na-2H 2 0; 99.65 
mg/ml of L-Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 1 1 .78 mg/L 
of Choline Chloride; 4.65 mg/L of Folic Acid; 15.60 mg/L of i-Inositol; 3.02 mg/L of 

20 Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.031 mg/L of Pyridoxine HCL; 0.3 19 mg/L 
of Riboflavin; 3.17 mg/L of Thiamine HCL; 0.365 mg/L of Thymidine; and 0.680 mg/L 
of Vitamin B 12 ; 25 mM of HEPES Buffer; 2.39 mg/L of Na Hypoxanthine; 0.105 mg/L of 
Lipoic Acid; 0.081 mg/L of Sodium Putrescine-2HCL; 55.0 mg/L of Sodium Pyruvate; 
0.0067 mg/L of Sodium Selenite; 20uM of Ethanolamine; 0.122 mg/L of Ferric Citrate; 

25 41 .70 mg/L of Methyl-B-Cyclodextrin complexed with Linoleic Acid; 33.33 mg/L of 
Methyl-B-Cyclodextrin complexed with Oleic Acid; and 10 mg/L of Methyl-B- 
Cyclodextrin complexed with Retinal) with 2mm glutamine and lx penstrep. (BSA (81- 
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068-3 Bayer) lOOgm dissolved in 1L DMEM for a 10% BSA stock solution). Filter the 
media and collect 50 ul for endotoxin assay in 15ml polystyrene conical. 

The transfection reaction is terminated, preferably by tag-teaming, at the end of the 
incubation period. Person A aspirates off the transfection media, while person B adds 
1.5ml appropriate media to each well. Incubate at 37 degrees C for 45 or 72 hours 
depending on the media used: 1 %BS A for 45 hours or CHO-5 for 72 hours. 

On day four, using a 300u! multichannel pipetter, aliquot 600ul in one 1 ml deep 
well plate and the remaining supernatant into a 2ml deep well. The supernatants from each 
well can then be used in the assays described in Examples 13-20. 

It is specifically understood that when activity is obtained in any of the assays 
described below using a supernatant, the activity originates from either the polypeptide 
directly (e.g., as a secreted protein) or by the polypeptide inducing expression of other 
proteins, which are then secreted into the supernatant. Thus, the invention further provides 
a method of identifying the protein in the supernatant characterized by an activity in a 
particular assay. 

Example 12: Construction of CAS Rp pnr« » r Crnistmrf 

One signal transduction pathway involved in the differentiation and proliferation of 
cells is called the Jaks-STATs pathway. Activated proteins in the Jaks-STATs pathway 
,bind to gamma activation site "GAS" elements or interferon-sensitive responsive element 
("ISRE"), located in the promoter of many genes. The binding of a protein to these 
elements alter the expression of the associated gene. 

GAS and ISRE elements are recognized by a class of transcription factors called 
Signal Transducers and Activators of Transcription, or "STATs." There are six members 
of the STATs family. Stall and Stat3 are present in many cell types, as is Stat2 (as 
response to IFN-alpha is widespread). Stat4 is more restricted and is not in many cell 
types though it has been found in T helper class I, cells after treatment with IL-12. StatS 
was originally called mammary growth factor, but has been found at higher concentrations 
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in other cells including myeloid cells. It can be activated in tissue culture cells by many 
cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus upon 
tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") family. 
Jaks represent a distinct family of soluble tyrosine kinases and include Tyk2, Jakl, Jak2, 
and Jak3. These kinases display significant sequence similarity and are generally 
catalytically inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the Table below. 
(Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:621-51 (1995).) A 
cytokine receptor family, capable of activating Jaks, is divided into two groups: (a) Class 1 
includes receptors for IL-2, IL-3. IL-4, IL-6, IL-7, 1L-9, IL-1 1, IL-12, IL-15, Epo, PRL, 
GH, G-CSF, GM-CSF, LIF, CNTF, and thrombopoietin; and (b) Class 2 includes IFN-a, 
IFN-g, and 1L-10. The Class 1 receptors share a conserved cysteine motif (a set of four 
conserved cysteines and one tryptophan) and a WSXWS motif (a membrane proximal 
region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID NO:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in tum activate 
STATs, which then translocate and bind to GAS elements. This entire process is 
encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of the 
GAS or the ISRE element, can be used to indicate proteins involved in the proliferation 
and differentiation of cells. For example, growth factors and cytokines are known to 
activate the Jaks-STATs pathway. (See Table below.) Thus, by using GAS elements 
linked to reporter molecules, activators of the Jaks-STATs pathway can be identified. 
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Ligand 

IFN family 
5 IFN-a/B 
IFN-g 
Il-IO 



10 



15 



20 



25 



30 



35 



mi 



gp!30 famj|v 

IL-6 (Pleiotrophic) + 

U-Il(Pleiotrophic) ? 

OnM(Pleiotrophic) ? 

UF(Pleiotrophic) ? 

CNTF(PIeiotrophic) -/+ 

G-CSF(Pleiotrophic) ? 

IL-12fPleiotrophic) + 

g-C family 
IL-2 (lymphocytes) 
IL-4 (lymph/myeloid) - 
IL-7 (lymphocytes) 
IL-9 (lymphocytes) 
IL-13 (lymphocyte) 
IL-15 ? 

CD 140 family 
IL-3 (myeloid) 
IL-5 (myeloid) 
GM-CSF (myeloid) 

Growth hn rmone fam j|y 
GH 7 
PRL 9 
EPO ? 

Receptor Tyrosine T^ina^c 
EGF 9 
PDGF 9 
CSF-1 9 



JAKs 
Mi 



+ 

+ 

9 



+ 
+ 
+ 
+ 
+ 
+ 



+ 
+ 
+ 
+ 
+ 
+ 
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+/- 



+ 
+ 
+ 



+ 

9 



+ 
? 

+ 

+ 

+ 
9 



+ 
+ 

+ 



+ 
+ 
+ 



+ 
+ 
+ 



+ 

+ 

+ 

+ 
9 



SEATS GAS(elem* m c) n f fgpp 



1.2,3 
1 

1.3 



1.3 
1.3 
1.3 
1.3 
1,3 
1.3 



1. 



1.3.5 

6 

5 

5 

6 

5 



5 

1,3.5 
5 



1,3 
1,3 
1.3 



ISRE 

GAS (IRFl>Lys6>IFP) 



GAS (IRFl>Lys6>IFP) 



GAS 

GAS (1RFI =IFP»Lv6)fleH) 

GAS " w 

GAS 

GAS 

GAS 



GAS (IRFl>lFP»Ly6) 

GAS 

GAS 



GAS(B-CAS>IRFI =IFP»Ly6) 



GAS (IRF1) 
GAS (not IRF1) 
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To construct a synthetic GAS containing promoter element, which is used in the 
Biological Assays described in Examples 13-14, a PCR based strategy is employed to 
generate a GAS-SV40 promoter sequence. The 5' primer contains four tandem copies of 
the GAS binding site found in the IRF1 promoter and previously demonstrated to bind 
STATs upon induction with a range of cytokines (Rothman et al., Immunity 1:457-468 
(1994).), although other GAS or ISRE elements can be used instead. The 5' primer also 
contains 18bp of sequence complementary to the SV40 early promoter sequence and is 
flanked with an Xhol site. The sequence of the 5' primer is: 

5VGCGCCTCGAGATITCCCCGAAATCTAGATTTCCCCGAAATGATTTCCCCGAA 
ATGATTTCCCCGAAATATCTGCCATCTCAATTAG:3' (SEQ ID NO:3) 

The downstream primer is complementary to the SV40 promoter and is flanked 
with a Hind III site: 5^GCGGC A AGCTTTTTGC AAAGCCTAGGC:3 ' (SEQ ID NO:4) 

PCR amplification is performed using the SV40 promoter template present in the 
B-ga!:promoter plasmid obtained from Clontech. The resulting PCR fragment is digested 
with Xhol/Hind III and subcloned into BLSK2-. (Stratagene.) Sequencing with forward 
and reverse primers confirms that the insert contains the following sequence: 
5^CjmA£ATTTCCCCGAAATCTAGATTTCCCCGAAATGATTTCCCCGAAATGA 
TTTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTA 
ACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCAT 
GGCTGACTAAi 1 1 1 ri l l ATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAG 

CTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTITTGCAAAAA 
eCH:3' (SEQIDNO:5) 

With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 
reporter construct is next engineered. Here, the reporter molecule is a secreted alkaline 
phosphatase, or "SEAP." Clearly, however, any reporter molecule can be instead of 
SEAP, in this or in any of the other Examples. Well known reporter molecules that can be 
used instead of SEAP include chloramphenicol acetyltransferase (CAT), luciferase, 
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alkaline phosphatase, B-galactosidase, green fluorescent protein (GFP), or any protein 
detectable by an antibody. 

The above sequence confirmed synthetic GAS-S V40 promoter element is 
subcloned into the pSEAP-Promoter vector obtained from Clontech using Hindin and 
Xhol, effectively replacing the SV40 promoter with the amplified GAS:SV40 promoter 
element, to create the GAS-SEAP vector. However, this vector does not contain a 
neomycin resistance gene, and therefore, is not preferred for mammalian expression 
...systems. 

Thus, in order to generate mammalian stable cell lines expressing the GAS-SEAP 
reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using Sail and 
Notl, and inserted into a backbone vector containing the neomycin resistance gene, such as 
pGFP-1 (Clontech), using these restriction sites in the multiple cloning site, to create the 
GAS-SEAP/Neo vector. Once this vector is transfected into mammalian cells, this vector 
can then be used as a reporter molecule for GAS binding as described in Examples 13-14. 

Other constructs can be made using the above description and replacing GAS with 
a different promoter sequence. For example, construction of reporter molecules containing 
NFK-B and EGR promoter sequences are described in Examples 15 and 16. However, 
many other promoters can be substituted using the protocols described in these Examples. 
For instance, SRE, IL-2, NFAT, or Osteocalcin promoters can be substituted, alone or in 
.combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB, 11-2/NFAT, or NF-KB/GAS). 
Similarly, other cell lines can be used to test reporter construct activity, such as HELA 
(epithelial), HUVEC (endothelial), Reh (B-cell), Saos-2 (osteoblast), HUVAC (aortic), or 
Cardiomyocyte. 

Example 13; H» p h-Thro„ P hn„t Screening A**» y f n-T.~|. A ftilify 

The following protocol is used to assess T-cell activity by identifying factors, and 
determining whether supemate containing a polypeptide of the invention proliferates 
and/or differentiates T-cells. T-cell activity is assessed using the GAS/SEAP/Neo 
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construct produced in Example 12. Thus, factors that increase SEAP activity indicate the 
ability to activate the Jaks-STATS signal transduction pathway. The T-cell used in this 
assay is Jurkat T-cells (ATCC Accession No. TIB-152), although Molt-3 cells (ATCC 
Accession No. CRL-1552) and Molt-4 cells (ATCC Accession No. CRL-1582) cells can 
also be used. 

Jurkat T-cells are lymphoblastic CD4+ ThI helper cells. In order to generate stable 
cell lines, approximately 2 million Jurkat cells are transfected with the GAS-SEAP/neo 
vector using DMRIE-C (Life Technologies)(transfection procedure described below). The 
transfected cells are seeded to a density of approximately 20,000 cells per well and 
transfectants resistant to 1 mg/ml genticin selected. Resistant colonies are expanded and 
then tested for their response to increasing concentrations of interferon gamma. The dose 
response of a selected clone is demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to generate 
sufficient cells for multiple 96 well plates. Jurkat cells are maintained in RPMI + 10% 
serum with l%Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life Technologies) with 10 
ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI-MEM containing 50 ul of DMRIE-C 
and incubate at room temperature for 15-45 mins. 

During the incubation period, count cell concentration, spin down the required 
number of cells (10 7 per transfection), and resuspend in OPTI-MEM to a final 
concentration of 10 7 celis/ml. Then add I ml of 1 x 10 7 cells in OPTI-MEM to T25 flask 
and incubate at 37 degrees C for 6 hrs. After the incubation, add 10 ml of RPMI + 15% 
serum. 

The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% serum, 
1 mg/ml Genticin, and 1% Pen-Strep. These cells are treated with supernatants containing 
polypeptides of the invention and/or induced polypeptides of the invention as produced by 
the protocol described in Example 11. 
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On the day of treatment with the supernatant, the cells should be washed and 
resuspended in fresh RPMI + 10% serum to a density of 500,000 cells per ml. The exact 
number of cells required will depend on the number of supematants being screened. For 
one 96 well plate, approximately 10 million cells (for 10 plates, 100 million cells) are 
required. 

Transfer the cells to a triangular reservoir boat, in order to dispense the cells into a 
96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 200 ul of 
cells into each well (therefore adding 100, 000 cells per well). 

After all the plates have been seeded, 50 ul of the supematants are transferred 
directly from the 96 well plate containing the supematants into each well using a 12 
channel pipette. In addition, a dose of exogenous interferon gamma (0.1; 1.0, 10 ng) is 
added to wells H9, H10, and HI 1 to serve as additional positive controls for the assay. 

The 96 well dishes containing Jurkat cells treated with supematants are placed in an 
incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul samples from 
each well are then transferred to an opaque 96 well plate using a 12 channel pipette. The 
opaque plates should be covered (using sellophene covers) and stored at -20 degrees C 
until SEAP assays are performed according to Example 17. The plates containing the 
remaining treated cells are placed at 4 degrees C and serve as a source of material for 
repeating the assay on a specific well if desired. 

As a positive control, 100 Unit/ml interferon gamma can be used which is known to 
activate Jurkat T cells. Over 30 fold induction is typically observed in the positive control 
wells. 

The above protocol may be used in the generation of both transient, as well as, 
stable transfected cells, which would be apparent to those of skill in the art. 

Example 14: Hjgh-TJironphniit Screening As s ay Identify inp Myeloid Ar tivity 

The following protocol is used to assess myeloid activity by determining whether 
polypeptides of the invention proliferates and/or differentiates myeloid cells. Myeloid cell 
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activity is assessed using the GAS/SEAP/Neo construct produced in Example 12. Thus, 
factors that increase SEAP activity indicate the ability to activate the Jaks-STATS signal 
transduction pathway. The myeloid cell used in this assay is U937, a pre-monocyte cell 
line, although TF-1, HL60, or KG1 can be used. 
5 To transiently transfect U937 cells with the GAS/SEAP/Neo construct produced in 

Example 12, a DEAE-Dextran method (Kharbanda et. al., 1994, Cell Growth & 
Differentiation, 5:259-265) is used. First, harvest 2xlO e 7 U937 cells and wash with PBS. 
The U937 cells are usually grown in RPMI 1640 medium containing 10% heat-inactivated 
fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 mg/ml 
10 streptomycin. 

Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer containing 0.5 
mg/ml DEAE-Dextran, 8 ug GAS-SEAP2 plasmid DNA, 140 mM NaCI, 5 mM KCI, 375 
uM Na 2 HP04.7H 2 0, 1 mM MgCI 2 , and 675 uM CaCl 2 . Incubate at 37 degrees C for 45 
min. 

15 Wash the cells with RPMI 1640 medium containing 10% FBS and then resuspend 

in 10 ml complete medium and incubate at 37 degrees C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 ug/ml 
G418. The G418-free medium is used for routine growth but every one to two months, the 
cells should be re-grown in 400 ug/ml G418 for couple of passages. 
20 These cells are tested by harvesting lxlO 8 cells (this is enough for ten 96-well 

plates assay) and wash with PBS. Suspend the cells in 200 ml above described growth 
medium, with a final density of 5xl0 5 cells/ml. Plate 200 ul cells per well in the 96-well 
plate (or lxl 0 5 cells/well). 

Add 50 ul of the supernatant prepared by the protocol described in Example 1 1. 
25 Incubate at 37 degrees C for 48 to 72 hr. As a positive control, 100 Unit/ml interferon 
gamma can be used which is known to activate U937 cells. Over 30 fold induction is 
typically observed in the positive control wells. SEAP assay the supernatant according to 
the protocol described in Example 17. 
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Example 15; High-Throiiphput Screening Assay Tdentifvinp N »,. r o na l Activity 

When cells undergo differentiation and proliferation, a group of genes are activated 
through many different signal transduction pathways. One of these genes, EGR1 (early 
growth response gene 1), is induced in various tissues and cell types upon activation. The 
promoter ofEGRl is responsible for such induction. Using the EGR1 promoter linked to 
reporter molecules, activation of cells can be assessed. 

Particularly, the following protocol is used to assess neuronal activity in PC12 cell 
lines. PC12 cells (rat phenochromocytoma cells) are known to proliferate and/or 
differentiate by activation with a number of mitogens, such as TPA (tetradecanoyl phorbol 
acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). The EGR1 gene 
expression is activated during this treatment. Thus, by stably transfecting PC12 cells with 
a construct containing an EGR promoter linked to SEAP reporter, activation of PC12 cells 
can be assessed. 

The EGR/SEAP reporter construct can be assembled by the following protocol. 
The EGR-1 promoter sequence (-633 to +l)(Sakamoto K et al., Oncogene 6:867-871 
(1991)) can be PCR amplified from human genomic DNA using the following primers: 
5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG-3' (SEQIDNO:6) 
5' GCGAAGCTTCGCGAGTCCCCGGATCCGCCTC-3' (SEQ ID NO:7) 
Using the GAS:SEAP/Neo vector produced in Example 12, EGRI amplified 
product can then be inserted into this vector. Linearize the GAS.SEAP/Neo vector using 
restriction enzymes Xhol/Hindm, removing the GAS/S V40 stuffer. Restrict the EGRI 
amplified product with these same enzymes. Ligate the vector and the EGRI promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution (1:30 
dilution of collagen type I (Upstate Biotech Inc. Cat#08-1 15) in 30% ethanol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per well of the 96-well plate, and allowed 
to air dry for 2 hr. 
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PC12 cells arc routinely grown in RPMI-1640 medium (Bio Whittaker) containing 
10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat-inactivated fetal 
bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 ug/ml 
streptomycin on a precoated 10 cm tissue culture dish. One to four split is done every 
three to four days. Cells are removed from the plates by scraping and resuspended with 
pipetting up and down for more than 15 times. 

Transfect the EGR/SEAP/Neo construct into PC12 using the Lipofectamine 
protocol described in Example 11. EGR-SEAP/PC12 stable cells are obtained by growing 
the cells in 300 ug/ml G418. The G418-free medium is used for routine growth but every 
one to two months, the cells should be re-grown in 300 ug/ml G418 for couple of passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% confluent 
is screened by removing the old medium. Wash the cells once with PBS (Phosphate 
buffered saline). Then starve the cells in low serum medium (RPMI-1640 containing 1% 
horse serum and 0.5% FBS with antibiotics) overnight. 

The next morning, remove the medium and wash the cells with PBS. Scrape off 
the cells from the plate, suspend the cells well in 2 ml low serum medium. Count the cell 
number and add more low serum medium to reach final cell density as 5x 10 5 cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to lxlO 5 
cells/well). Add 50 ul supernatant produced by Example 11, 37°C for 48 to 72 hr. As a 
positive control, a growth factor known to activate PC12 cells through EGR can be used, 
such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold induction of SEAP is 
typically seen in the positive control wells. SEAP assay the supernatant according to 
Example 17. 

Example 16: High-Throughput Screenin g Assay far T-cell Activity 

NF-KB (Nuclear Factor KB) is a transcription factor activated by a wide variety of 
agents including the inflammatory cytokines IL-1 and TNF, CD30 and CD40, 
lymphotoxin-alpha and lymphotoxin-beta, by exposure to LPS or thrombin, and by 
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sealer and incubate at 65 degree C for 30 mm. Separate the Optiplates to avoid uneven 
heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser and 
prime with the Assay Buffer. Add 50 ml Assay Buffer and incubate at room temperature 5 
5 min. Empty the dispenser and prime with the Reaction Buffer (see the table below). Add 
50 ul Reaction Buffer and incubate at room temperature for 20 minutes. Since the 
intensity of the chemiluminescent signal is time dependent, and it takes about 10 minutes 
to read 5 plates on luminometer, one should treat 5 plates at each time and start the second 
set 10 minutes later. 

Read the relative light unit in the luminometer. Set H12 as blank, and print the 
results. An increase in chemiluminescence indicates reporter activity. 

Reaction Buffer Formulation: 



# of plates 
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11 

12 

13 

14 
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16 

17 
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23 

24 

25 
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33 
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35 
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Rxn buffer diluent (ml) 
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60 
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65 


3.25 


70 
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75 


3.75 


80 
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85 


4.25 


90 


4.5 


95 


4.75 


100 
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105 


5.25 


110 


5.5 


115 


5.75 


120 
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125 


6.25 


130 


6.5 


135 


6.75 


140 
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145 


7.25 


150 


7.5 


155 


7.75 


160 
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165 


8.25 


170 


8.5 


175 


8.75 


180 
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185 


9.25 


190 


9.5 


195 


9.75 


200 


10 


205 


10.25 
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40 210 io.5 

41 215 10.75 

42 220 a 

43 225 u.25 

44 230 n.5 

45 235 11.75 

46 240 12 

47 245 12.25 

48 250 12^5 

49 255 12.75 
50 260 13 



Example 18: High-Throuphpnt Screening Assay Identifying Chang es in Small 
Molecule C oncentration and Membrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
5 molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supematants which 
bind to receptors of a particular cell. Although the following protocol describes an assay 
for calcium, this protocol can easily be modified to detect changes in potassium, sodium, 
pH, membrane potential, or any other small molecule which is detectable by a fluorescent 
10 probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to 
measure changes in fluorescent molecules (Molecular Probes) that bind small molecules. 
Clearly, any fluorescent molecule detecting a small molecule can be used instead of the 
calcium fluorescent molecule, fluo-4 (Molecular Probes, Inc.; catalog no. F-14202), used 
15 here. 

For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star black 96- 
well plate with clear bottom. The plate is incubated in a C0 2 incubator for 20 hours. The 
adherent cells are washed two times in Biotek washer with 200 ul of HBSS (Hank's 
Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 
20 A stock solution of 1 mg/ml fluo-4 is made in 10% pluronic acid DMSO. To load 

the cells with fluo-4 , 50 ul of 12 ug/ml fluo-4 is added to each well. The plate is 
incubated at 37 degrees C in a C0 2 incubator for 60 min. The plate is washed four times in 
the Biotek washer with HBSS leaving 100 ul of buffer. 

SUBSTITUTE SHEET (RULE 26) 



THIS PAGE BLANK (uspto) 



5 



0 



WO 00/29435 

PCT/US99/25031 

496 

For non-adherent cells, the cells are spun down from culture media. Ceils are re- 
suspended to 2-5x10' cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml fluo-4 
solution in 10% pluronic acid DMSO is added to each ml of cell suspension. The tube is 
then placed in a 37 degrees C water bath for 30-60 min. The cells are washed twice with 
HBSS, resuspended to 1x10* cells/ml, and dispensed into a microplate, 100 ul/well. The 
plate is centrifuged at 1000 rpm for 5 min. The plate is then washed once in Denley 
CellWash with 200 ul, followed by an aspiration step to 100 ul final volume. 

For a non-cell based assay, each well contains a fluorescent molecule, such as fluo- 
4 . The supernatant is added to the well, and a change in fluorescence is detected. 

To measure the fluorescence of intracellular calcium, the FLIPR is set for the 
following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 second; 
(3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; and (6) Sample 
addition is 50 ul. Increased emission at 530 nm indicates an extracellular signaling event 
which has resulted in an increase in the intracellular Ca++ concentration. 

Example 19: Hirn-Thnwnnut Screen^ i^,,^ Kingw ^ ity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of transmembrane 
and cytoplasmic kinases. Within the Receptor Protein Tyrosine Kinase RPTK) group are 
receptors for a range of mitogenic and metabolic growth factors including the PDGF, FGF, 
EGF, NGF, HGF and Insulin receptor subfamilies. In addition there are a large family of 
RPTKs for which the corresponding ligand is unknown. Ligands for RPTKs include 
mainly secreted small proteins, but also membrane-bound and extracellular matrix 
proteins. 

Activation of RPTK by ligands involves ligand-mediated receptor dimerization, 
resulting in ^phosphorylation of the receptor subunits and activation of the cytoplasmic 
tyrosine kinases. The cytoplasmic tyrosine kinases include receptor associated tyrosine 
kinases of the src-family (e.g., src, yes, Ick, lyn, fyn) and non-receptor linked and cytosolic 
protein tyrosine kinases, such as the Jak family, members of which mediate signal 
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transduction triggered by the cytokine superfamily of receptors (e.g., the Interleukins, 
Interferons, GM-CSF, and Leptin). 

Because of the wide range of known factors capable of stimulating tyrosine kinase 
activity, the identification of novel human secreted proteins capable of activating tyrosine 
kinase signal transduction pathways are of interest. Therefore, the following protocol is 
designed to identify those novel human secreted proteins capable of activating the tyrosine 
kinase signal transduction pathways. 

Seed target cells (e.g., primary keratinocytes) at a density of approximately 25,000 
cells per well in a 96 well Loprodyne Silent Screen Plates purchased from Nalge Nunc 
(Naperville, IL). The plates are sterilized with two 30 minute rinses with 100% ethanol, 
rinsed with water and dried overnight. Some plates are coated for 2 hr with 100 ml of cell 
culture grade type 1 collagen (50 mg/ml), gelatin (2%) or polylysine (50 mg/ml), all of 
which can be purchased from Sigma Chemicals (St. Louis, MO) or 10% Matrigel 
purchased from Becton Dickinson (Bedford,MA), or calf serum, rinsed with PBS and 
stored at 4 degree C. Cell growth on these plates is assayed by seeding 5,000 cells/well in 
growth medium and indirect quantitation of cell number through use of alamarBlue as 
described by the manufacturer Alamar Biosciences, Inc. (Sacramento, CA) after 48 hr. 
Falcon plate covers #3071 from Becton Dickinson (Bedford,MA) are used to cover the 
Loprodyne Silent Screen Plates. Falcon Microtest III cell culture plates can also be used in 
some proliferation experiments. 

To prepare extracts, A431 cells are seeded onto the nylon membranes of Loprodyne 
plates (20,000/200ml/well) and cultured overnight in complete medium. Cells are 
quiesced by incubation in serum-free basal medium for 24 hr. After 5-20 minutes 
treatment with EOF (60ng/ml) or 50 ul of the supernatant produced in Example 1 1, the 
medium was removed and 100 ml of extraction buffer ((20 mM HEPES pH 7.5, 0.15 M 
NaCI, 1% Triton X-100, 0.1% SDS, 2 mM Na3V04, 2 mM Na4P207 and a cocktail of 
protease inhibitors (# 1836170) obtained from Boeheringer Mannheim (Indianapolis, IN) is 
added to each well and the plate is shaken on a rotating shaker for 5 minutes at 4 degrees 
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C. The plate is then placed in a vacuum transfer manifold and the extract filtered through 
the 0.45 mm membrane bottoms of each well using house vacuum. Extracts are collected 
in a 96-well catch/assay plate in the bottom of the vacuum manifold and immediately 
placed on ice. To obtain extracts clarified by centrifugation, the content of each well, after 
detergent solubilization for 5 minutes, is removed and centrifuged for 15 minutes at 4 
degrees C at 16,000 x g. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
methods of detecting tyrosine kinase activity are known, one method is described here. 

Generally, the tyrosine kinase activity of a supernatant is evaluated by determining 
its ability to phosphorylate a tyrosine residue on a specific substrate (a biotinylated 
peptide). Biotinylated peptides that can be used for this purpose include PSK1 
(corresponding to amino acids 6-20 of the cell division kinase cdc2-p34) and PSK2 
(corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for a range of 
tyrosine kinases and are available from Boehringer Mannheim. 

The tyrosine kinase reaction is set up by adding the following components in order. 
First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg 2+ (5mM ATP/50mM 
MgCI 2 ), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, pH7.3, 40 mM 
beta-glycerophosphate, ImM EGTA, lOOmM MgCl 2 , 5 mM MnCI 2> 0.5 mg/ml BSA), 
then 5ul of Sodium Vanadate(l mM), and then 5ul of water. Mix the components gently 
and preincubate the reaction mix at 30 degrees C for 2 min. Initial the reaction by adding 
lOul of the control enzyme or the filtered supernatant. 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 120mm 
EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 
mixture to a microliter plate (MTP) module and incubating at 37 degrees C for 20 min. 
This allows the streptavadin coated 96 well plate to associate with the biotinylated peptide. 
Wash the MTP module with 300ul/well of PBS four times. Next add 75 ul of anti- 
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phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr-POD(0.5u/ml)) 
to each well and incubate at 37 degrees C for one hour. Wash the well as above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and 
incubate at room temperature for at least 5 mins (up to 30 min). Measure the absorbance 
5 of the sample at 405 nm by using ELISA reader. The level of bound peroxidase activity is 
quantitated using an ELISA reader and reflects the level of tyrosine kinase activity. 

Example 2Q; Hiph-Thx(mghPUt Screening Assay Irfontify fr p Phosp ho ry l ation Activity 
As a potential alternative and/or compliment to the assay of protein tyrosine kinase 

10 activity described in Example 19, an assay which detects activation (phosphorylation) of 
major intracellular signal transduction intermediates can also be used. For example, as 
described below one particular assay can detect tyrosine phosphorylation of the Erk- 1 and 
Erk-2 kinases. However, phosphorylation of other molecules, such as Raf, JNK, p38 
MAP, Map kinase kinase (MEK), MEK kinase, Src, Muscle specific kinase (MuSK), 

15 IRAK, Tec, and Janus, as well as any other phosphoserine, phosphotyrosine, or 

phosphothreonine molecule, can be detected by substituting these molecules for Erk-1 or 
Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96-well ELISA plate 
with 0.1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are then rinsed 

20 with PBS and blocked with 3% BSA/PBS for 1 hr at RT. The protein G plates are then 
treated with 2 commercial monoclonal antibodies (lOOng/well) against Erk-land Erk-2 (1 
hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this step can easily be 
modified by substituting a monoclonal antibody detecting any of the above described 
molecules.) After 3-5 rinses with PBS, the plates are stored at 4 degrees C until use. 

25 A43 1 cells are seeded at 20,000/weIl in a 96-well Loprodyne filterplate and 

cultured overnight in growth medium. The cells are then starved for 48 hr in basal medium 
(DMEM) and then treated with EGF (6ng/well) or 50 ul of the supematants obtained in 
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Example 1 1 for 5-20 minutes. The cells are then solubilized and extracts filtered directly 
into the assay plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. As a 
positive control, a commercial preparation of MAP kinase (lOng/well) is used in place of 
A431 extract. Plates are then treated with a commercial polyclonal (rabbit) antibody 
(1 ug/ml) which specifically recognizes the phosphorylated epitope of the Erk-1 and Erk-2 
kinases (lhr at RT). This antibody is biotinylated by standard procedures. The bound 
polyclonal antibody is then quantitated by successive incubations with Europium- 
streptavidin and Europium fluorescence enhancing reagent in the Wallac DELFIA 
instrument (time-resolved fluorescence). An increased fluorescent signal over background 
indicates a phosphorylation. 

Example 21 ; Method of Determining Alt«r,rtnp < , in a r rfnP (W SDO ndinp in » 
Polynucleotide 

RNA isolated from entire families or individual patients presenting with a 
phenotype of interest (such as a disease) is be isolated. cDNA is then generated from these 
RNA samples using protocols known in the art. (See, Sambrook.) The cDNA is then used 
as a template for PCR, employing primers surrounding regions of interest in SEQ ID 
NO:X. Suggested PCR conditions consist of 35 cycles at 95 degrees C for 30 seconds; 60- 
120 seconds at 52-58 degrees C; and 60-120 seconds at 70 degrees C, using buffer 
solutions described in Sidransky et al„ Science 252:706 (1991). 

PCR products are then sequenced using primers labeled at their 5' end with T4 
polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre Technologies). 
The intron-exon borders of selected exons is also determined and genomic PCR products 
analyzed to confirm the results. PCR products harboring suspected mutations is then 
cloned and sequenced to validate the results of the direct sequencing. 

PCR products is cloned into T-tailed vectors as described in Holton et al., Nucleic 
Acids Research, 19: 1 156 (1991) and sequenced with T7 polymerase (United States 
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Biochemical). Affected individuals are identified by mutations not present in unaffected 
individuals. 

Genomic rearrangements are also observed as a method of determining alterations 
in a gene corresponding to a polynucleotide. Genomic clones isolated according to 
5 Example 2 are nick-translated with digoxigenindeoxy-uridine 5'-triphosphate (Boehringer 
Manheim), and FISH performed as described in Johnson et al., Methods Cell Biol. 35:73- 
99 (1991). Hybridization with the labeled probe is carried out using a vast excess of 
human cot-1 DNA for specific hybridization to the corresponding genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phenylidole and propidium 

10 iodide, producing a combination of C- and R-bands. Aligned images for precise mapping 
are obtained using a triple-band filter set (Chroma Technology, Brattleboro, VT) in 
combination with a cooled charge-coupled device camera (Photometries, Tucson, AZ) and 
variable excitation wavelength filters. (Johnson et al., Genet. Anal. Tech. Appl., 8:75 
(1991).) Image collection, analysis and chromosomal fractional length measurements are 

1 5 performed using the ISee Graphical Program System. (Inovision Corporation, Durham, 
NC.) Chromosome alterations of the genomic region hybridized by the probe are 
identified as insertions, deletions, and translocations. These alterations are used as a 
diagnostic marker for an associated disease. 

20 Example 22: Method of Detecting Abnorma l Levels of a Po lypeptide in a Riolnpi™! 
Sample 

A polypeptide of the present invention can be detected in a biological sample, and 
if an increased or decreased level of the polypeptide is detected, this polypeptide is a 
marker for a particular phenotype. Methods of detection are numerous, and thus, it is 
25 understood that one skilled in the art can modify the following assay to fit their particular 
needs. 

For example, antibody-sandwich ELISAs are used to detect polypeptides in a 
sample, preferably a biological sample. Wells of a microliter plate are coated with specific 
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antibodies, at a final concentration of 0.2 to 10 ug/ml. The antibodies are either 
monoclonal or polyclonal and are produced by the method described in Example 10. The 
wells are blocked so that non-specific binding of the polypeptide to the well is reduced. 
The coated wells are then incubated for > 2 hours at RT with a sample containing 
5 the polypeptide. Preferably, serial dilutions of the sample should be used to validate 

results. The plates are then washed three times with deionized or distilled water to remove 
unbounded polypeptide. 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a concentration 
of 25-400 ng, is added and incubated for 2 hours at room temperature. The plates are again 
10 washed three times with deionized or distilled water to remove unbounded conjugate. 

Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl phosphate 
(NPP) substrate solution to each well and incubate 1 hour at room temperature. Measure 
the reaction by a microliter plate reader. Prepare a standard curve, using serial dilutions of 
a control sample, and plot polypeptide concentration on the X-axis (log scale) and 
15 fluorescence or absorbance of the Y-axis (linear scale). Interpolate the concentration of 
the polypeptide in the sample using the standard curve. 

Example 23: Fnri^.^i^ 

The invention also provides methods of treatment and/or prevention of diseases or 
20 disorders (such as, for example, any one or more of the diseases or disorders disclosed 
herein) by administration to a subject of an effective amount of a Therapeutic. By 
therapeutic is meant a polynucleotides or polypeptides of the invention (including 
fragments and variants), agonists or antagonists thereof, and/or antibodies thereto, in 
combination with a pharmaceutically acceptable carrier type (e.g., a sterile carrier). 
25 The Therapeutic will be formulated and dosed in a fashion consistent with good 

medical practice, taking into account the clinical condition of the individual patient 
(especially the side effects of treatment with the Therapeutic alone), the site of delivery, 
the method of administration, the scheduling of administration, and other factors known to 
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practitioners. The "effective amount" for purposes herein is thus determined by such 
considerations. 

As a general proposition, the total pharmaceutical^ effective amount of the 
Therapeutic administered parenterally per dose will be in the range of about lug/kg/day to 
10 mg/kg/day of patient body weight, although, as noted above, this will be subject to 
therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and most 
preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If given 
continuously, the Therapeutic is typically administered at a dose rate of about 1 ug/kg/hour 
to about 50 ug/kg/hour, either by 1-4 injections per day or by continuous subcutaneous 
infusions, for example, using a mini-pump. An intravenous bag solution may also be 
employed. The length of treatment needed to observe changes and the interval following 
treatment for responses to occur appears to vary depending on the desired effect. 

Therapeutics can be are administered orally, rectally, parenterally, intracistemally, 
intravaginally, intraperitoneal^, topically (as by powders, ointments, gels, drops or 
transdermal patch), bucally, or as an oral or nasal spray. Tharmaceutically acceptable 
carrier" refers to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material 
or formulation auxiliary of any. The term "parenteral" as used herein refers to modes of 
administration which include intravenous, intramuscular, intraperitoneal, intrasternal, 
subcutaneous and intraarticular injection and infusion. 

Therapeutics of the invention are also suitably administered by sustained-release 
systems. Suitable examples of sustained-release Therapeutics are administered orally, 
rectally, parenterally, intracistemally, intravaginally, intraperitoneally, topically (as by 
powders, ointments, gels, drops or transdermal patch), bucally, or as an oral or nasal spray. 
"Pharmaceutically acceptable carrier" refers to a non-toxic solid, semisolid or liquid filler, 
diluent, encapsulating material or formulation auxiliary of any type. The term "parenteral" 
as used herein refers to modes of administration which include intravenous, intramuscular, 
intraperitoneal, intrasternal, subcutaneous and intraarticular injection and infusion. 
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Therapeutics of the invention are also suitably administered by sustained-release 
systems. Suitable examples of sustained-release Therapeutics include suitable polymeric 
materials (such as, for example, semi-permeable polymer matrices in the form of shaped 
articles, e.g., films, or mirocapsules), suitable hydrophobic materials (for example as an 
5 emulsion in an acceptable oil) or ion exchange resins, and sparingly soluble derivatives 
(such as, for example, a sparingly soluble salt). 

Sustained-release matrices include polylactides (U.S. Pat. No. 3,773,919, EP 
58,481), copolymer, of L-glutamic acid and gamma-ethyl-L-glutamate (Sidman et al., 
Biopolymers 22:547-556 (1983)), poly (2- hydroxyethyl methacrylate) (Langer et al., J. 
10 Biomed. Mater. Res. 15:167-277 (1981), and Langer, Chem. Tech. 12:98-105(1982)), 
ethylene vinyl acetate (Langer et al., Id.) or poly-D- (-Ki-hydroxybutyric acid (EP 
133,988). 

Sustained-release Therapeutics also include liposomally entrapped Therapeutics of 
the invention (see generally. Langer, Science 249:1527-1533 (1990); Treat et al., in 
15 Liposomes in the Therapy of Infectious Disease and Cancer, Lopez-Berestein and Fidler 
(eds.), Liss, New York, pp. 317 -327 and 353-365 (1989)). Liposomes containing the 
Therapeutic are prepared by methods known per se: DE 3,218,121; Epstein et al., Proc. 
Natl. Acad. Sci. (USA) 82:3688-3692 (1985); Hwang et al., Proc. Natl. Acad. Sci.(USA) 
77:4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 142,641; 
Japanese Pat. Appl. 83-1 18008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102,324. 
Ordinarily, the liposomes are of the small (about 200-800 Angstroms) unilamellar type in 
which the lipid content is greater than about 30 mol. percent cholesterol, the selected 
proportion being adjusted for the optimal Therapeutic. 

In yet an additional embodiment, the Therapeutics of the invention are delivered 
by way of a pump (see Langer, supra; Sefton, CRC Crit. Ref. Biomed. Eng. 14:201 (1987); 
Buchwald et al.. Surgery 88:507 (1980); Saudek et al., N. Engl. J. Med. 321:574 (1989)). 

Other controlled release systems are discussed in the review by Langer (Science 
249:1527-1533(1990)). 
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For parenteral administration, in one embodiment, the Therapeutic is formulated 
generally by mixing it at the desired degree of purity, in a unit dosage injectable form 
(solution, suspension, or emulsion), with a pharmaceutically acceptable carrier, i.e., one 
that is non-toxic to recipients at the dosages and concentrations employed and is 
compatible with other ingredients of the formulation. For example, the formulation 
preferably does not include oxidizing agents and other compounds that are known to be 
deleterious to the Therapeutic. 

Generally, the formulations are prepared by contacting the Therapeutic uniformly 
and intimately with liquid carriers or finely divided solid carriers or both. Then, if 
necessary, the product is shaped into the desired formulation. Preferably the carrier is a 
parenteral carrier, more preferably a solution that is isotonic with the blood of the 
recipient. Examples of such carrier vehicles include water, saline, Ringer's solution, and 
dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl oleate are also useful 
herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as substances that 
enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at 
the dosages and concentrations employed, and include buffers such as phosphate, citrate, 
succinate, acetic acid, and other organic acids or their salts; antioxidants such as ascorbic 
acid; low molecular weight (less than about ten residues) polypeptides, e.g., polyarginine 
or tripeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic 
polymers such as polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, 
aspartic acid, or arginine; monosaccharides, disaccharides, and other carbohydrates 
including cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such as 
EDTA; sugar alcohols such as mannitol or sorbitol; counterions such as sodium; and/or 
nonionic surfactants such as polysorbates, poloxamers, or PEG. 

The Therapeutic is typically formulated in such vehicles at a concentration of about 
0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of about 3 to 8. It will be 
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understood that the use of certain of the foregoing excipients, carriers, or stabilizers will 
result in the formation of polypeptide salts. 

Any pharmaceutical used for therapeutic administration can be sterile. Sterility is 
readily accomplished by filtration through sterile filtration membranes (e.g., 0.2 micron 
membranes). Therapeutics generally are placed into a container having a sterile access 
port, for example, an intravenous solution bag or vial having a stopper pierceable by a 
hypodermic injection needle. 

Therapeutics ordinarily will be stored in unit or multi-dose containers, for example, 
sealed ampoules or vials, as an aqueous solution or as a lyophilized formulation for 
reconstitution. As an example of a lyophilized formulation, 10-ml vials are filled with 5 
ml of sterile-filtered 1% (w/v) aqueous Therapeutic solution, and the resulting mixture is 
lyophilized. The infusion solution is prepared by reconstituting the lyophilized 
Therapeutic using bacteriostatic Water-for-lnjection. 

The invention also provides a pharmaceutical pack or kit comprising one or more 
containers filled with one or more of the ingredients of the Therapeutics of the invention. 
Associated with such containers) can be a notice in the form prescribed by a governmental 
agency regulating the manufacture, use or sale of pharmaceuticals or biological products, 
which notice reflects approval by the agency of manufacture, use or sale for human 
administration. In addition, the Therapeutics may be employed in conjunction with other 
therapeutic compounds. 

The Therapeutics of the invention may be administered alone or in combination 
with adjuvants. Adjuvants that may be administered with the Therapeutics of the invention 
include, but are not limited to, alum, alum plus deoxycholate (ImmunoAg), MTP-PE 
(Biocine Corp.), QS21 (Genentech, Inc.), BCG, and MPL. In a specific embodiment, 
Therapeutics of the invention are administered in combination with alum. In another 
specific embodiment, Therapeutics of the invention are administered in combination with 
QS-21. Further adjuvants that may be administered with the Therapeutics of the invention 
include, but are not limited to, Monophosphoryl lipid immunomodulator, AdjuVax 100a, 
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QS-21, QS-18, CRL1005, Aluminum salts, MF-59, and Virosomal adjuvant technology. 
Vaccines that may be administered with the Therapeutics of the invention include, but are 
not limited to, vaccines directed toward protection against MMR (measles, mumps, 
rubella), polio, varicella, tetanus/diptheria, hepatitis A, hepatitis B, haemophilus influenzae 
B, whooping cough, pneumonia, influenza, Lyme's Disease, rotavirus, cholera, yellow 
fever, Japanese encephalitis, poliomyelitis, rabies, typhoid fever, and pertussis. 
Combinations may be administered either concomitantly, e.g., as an admixture, separately 
but simultaneously or concurrently; or sequentially. This includes presentations in which 
the combined agents are administered together as a therapeutic mixture, and also 
procedures in which the combined agents are administered separately but simultaneously, 
e.g., as through separate intravenous lines into the same individual. Administration "in 
combination" further includes the separate administration of one of the compounds or 
agents given first, followed by the second. 

The Therapeutics of the invention may be administered alone or in combination 
with other therapeutic agents. Therapeutic agents that may be administered in combination 
with the Therapeutics of the invention, include but not limited to, other members of the 
TNF family, chemotherapeutic agents, antibiotics, steroidal and non-steroidal anti- 
inflammatories, conventional immunotherapeutic agents, cytokines and/or growth factors. 
Combinations may be administered either concomitantly, e.g., as an admixture, separately 
but simultaneously or concurrently; or sequentially. This includes presentations in which 
the combined agents are administered together as a therapeutic mixture, and also 
procedures in which the combined agents are administered separately but simultaneously, 
e.g., as through separate intravenous lines into the same individual. Administration "in 
combination" further includes the separate administration of one of the compounds or 
agents given first, followed by the second. 

In one embodiment, the Therapeutics of the invention are administered in combination 
with members of the TNF family. TNF, TNF-related or TNF-like molecules that may be 
administered with the Therapeutics of the invention include, but are not limited to, soluble 
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forms of TNF-alpha, lymphotoxin-alpha (LT-alpha, also known as TNF-beta), LT-beta 
(found in complex heterotrimer LT-alpha2-beta), OPGL, FasL, CD27L, CD30L, CD40L, 
4-1BBL, DcR3, OX40L, TNF-gamma (International Publication No. WO 96/14328), 
AIM-I (International Publication No. WO 97/33899), endokine-alpha (International 
Publication No. WO 98/07880), TR6 (International Publication No. WO 98/30694), OPG, 
and neutrokine-alpha (International Publication No. WO 98/18921, OX40, and nerve 
growth factor (NGF), and soluble forms of Fas, CD30, CD27, CD40 and 4-IBB, TR2 
(International Publication No. WO 96/34095), DR3 (International Publication No. WO 
97/33904), DR4 (International Publication No. WO 98/32856), TR5 (International 
Publication No. WO 98/30693), TR6 (International Publication No. WO 98/30694), TR7 
(International Publication No. WO 98/41629), TRANK, TR9 (International Publication 
No. WO 98/56892),TR10 (International Publication No. WO 98/54202), 312C2 
(International Publication No. WO 98/06842), and TR12, and soluble forms CD154, 
CD70, and CD153. 

In certain embodiments, Therapeutics of the invention are administered in 
combination with antiretroviral agents, nucleoside reverse transcriptase inhibitors, non- 
nucleoside reverse transcriptase inhibitors, and/or protease inhibitors. Nucleoside reverse 
transcriptase inhibitors that may be administered in combination with the Therapeutics of 
the invention, include, but are not limited to, RETROVIR™ (zidovudine/ AZT), VIDEX™ 
(didanosine/ddl), HIVID™ (zalcitabine/ddC), ZERIT™ (stavudine/d4T), EPIVIR™ 
(lamivudine/3TC), and COMBIVIR™ (zidovudine/lamivudine). Non-nucleoside reverse 
transcriptase inhibitors that may be administered in combination with the Therapeutics of 
the invention, include, but are not limited to, VIRAMUNE™ (nevirapine), 
RESCRIPTOR™ (delavirdine), and SUSTIVA™ (efavirenz). Protease inhibitors that may 
be administered in combination with the Therapeutics of the invention, include, but are not 
limited to, CRIXIVAN™ (indinavir), NORVIR™ (ritonavir), INVIRASE™ (saquinavir), 
and VIRACEPT™ (nelfinavir). In a specific embodiment, antiretroviral agents, 
nucleoside reverse transcriptase inhibitors, non-nucleoside reverse transcriptase inhibitors, 
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and/or protease inhibitors may be used in any combination with Therapeutics of the 
invention to treat AIDS and/or to prevent or treat HIV infection. 

In other embodiments, Therapeutics of the invention may be administered in 
combination with anti-opportunistic infection agents. Anti-opportunistic agents that may 
be administered in combination with the Therapeutics of the invention, include, but are not 
limited to, TRIMETHOPRIM-SULFAMETHOXAZOLE™, DAPSONE™, 
PENTAMIDINE™ , ATOVAQUONE™ , ISONIAZID™ , RIFAMPIN™, 
PYR AZINAMIDE™ , ETHAMBUTOL™, RIFABUTIN™, CLARITHROMYCIN™, 
AZITHROMYCIN™, GANCICLOVIR™, FOSCARNET™, CIDOFOVIR™, 
FLUCONAZOLE™, ITRACONAZOLE™, KETOCONAZOLE™, ACYCLOVIR™, 
FAMCICOLVIR™, PYRIMETHAMINE™, LEUCOVORIN™, NEUPOGEN™ 
(filgrastim/G-CSF), and LEUKINE™ (sargramostim/GM-CSF). In a specific 
embodiment, Therapeutics of the invention are used in any combination with 
TRIMETHOPRIM-SULFAMETHOXAZOLE™, DAPSONE™, PENTAMIDINE™, 
and/or ATOVAQUONE™ to prophylactically treat or prevent an opportunistic 
Pneumocystis carinii pneumonia infection. In another specific embodiment, Therapeutics 
of the invention are used in any combination with ISONIAZID™, RIFAMPIN™, 
PYR AZINAMIDE™ , and/or ETHAMBUTOL™ to prophylactically treat or prevent an 
opportunistic Mycobacterium avium complex infection. In another specific embodiment. 
Therapeutics of the invention are used in any combination with RIFABUTIN™, 
CLARITHROMYCIN™, and/or AZITHROMYCIN™ to prophylactically treat or prevent 
an opportunistic Mycobacterium tuberculosis infection. In another specific embodiment, 
Therapeutics of the invention are used in any combination with GANCICLOVIR™, 
FOSCARNET™, and/or CIDOFOVIR™ to prophylactically treat or prevent an 
opportunistic cytomegalovirus infection. In another specific embodiment, Therapeutics of 
the invention are used in any combination with FLUCONAZOLE™, 
ITRACONAZOLE™, and/or KETOCONAZOLE™ to prophylactically treat or prevent an 
opportunistic fungal infection. In another specific embodiment, Therapeutics of the 
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invention are used in any combination with ACYCLOVIR™ and/or FAMCICOLVIR™ t0 
prophylactically treat or prevent an opportunistic herpes simplex virus type I and/or type II 
infection. In another specific embodiment, Therapeutics of the invention are used in any 
combination with PYRIMETHAMINE™ and/or LEUCOVORIN™ to prophylactically 
5 treat or prevent an opportunistic Toxoplasma gondii infection. In another specific 
embodiment, Therapeutics of the invention are used in any combination with 
LEUCOVORIN and/or NEUPOGEN™ to prophylactically treat or prevent an 
opportunistic bacterial infection. 

In a further embodiment, the Therapeutics of the invention are administered in 
10 combination with an antiviral agent. Antiviral agents that may be administered with the 
Therapeutics of the invention include, but are not limited to, acyclovir, ribavirin, 
amantadine, and remantidine. 

In a further embodiment, the Therapeutics of the invention are administered in 
combination with an antibiotic agent. Antibiotic agents that may be administered with the 

15 Therapeutics of the invention include, but are not limited to, amoxicillin, beta-lactamases, 
aminoglycosides, beta-lactam (glycopeptide), beta-lactamases, Clindamycin, 
chloramphenicol, cephalosporins, ciprofloxacin, ciprofloxacin, erythromycin, 
fluoroquinolones, macrolides, metronidazole, penicillins, quinolones, rifampin, 
streptomycin, sulfonamide, tetracyclines, trimethoprim, trimethoprim-sulfamthoxazole, 

20 and vancomycin. 

Conventional nonspecific immunosuppressive agents, that may be administered in 
combination with the Therapeutics of the invention include, but are not limited to, 
steroids, cyclosporine, cyclosporine analogs, cyclophosphamide methylprednisone, 
prednisone, azathioprine, FK-506, 15-deoxyspergualin, and other immunosuppressive 
25 agents that act by suppressing the function of responding T cells. 

In specific embodiments. Therapeutics of the invention are administered in 
combination with immunosuppressants. Immunosuppressants preparations that may be 
administered with the Therapeutics of the invention include, but are not limited to, 
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ORTHOCLONE™ (OKT3), SANDIMMUNE™/NEORAL™/SANGDYA™ (cyclosporin), 
PROGRAF™ (tacrolimus), CELLCEPT™ (mycophenolate), Azathioprine, 
glucorticosteroids, and RAPAMUNE™ (sirolimus). In a specific embodiment, 
immunosuppressants may be used to prevent rejection of organ or bone marrow 
5 transplantation. 

In an additional embodiment, Therapeutics of the invention are administered alone 
or in combination with one or more intravenous immune globulin preparations. 
Intravenous immune globulin preparations that may be administered with the Therapeutics 
of the invention include, but not limited to, GAMMAR™, IVEEGAM™, 
10 SANDOGLOBULIN™, GAMMAGARD S/D™, and GAMIMUNE™. In a specific 
embodiment, Therapeutics of the invention are administered in combination with 
intravenous immune globulin preparations in transplantation therapy (e.g., bone marrow 
transplant). 

In an additional embodiment, the Therapeutics of the invention are administered 
15 alone or in combination with an anti -inflammatory agent. Anti-inflammatory agents that 
may be administered with the Therapeutics of the invention include, but are not limited to, 
glucocorticoids and the nonsteroidal antiinflammatories, aminoarylcarboxylic acid 
derivatives, arylacetic acid derivatives, arylbutyric acid derivatives, arylcarboxylic acids, 
arylpropionic acid derivatives, pyrazoles, pyrazolones, salicylic acid derivatives, 
20 thiazinecarboxamides, e-acetamidocaproic acid, S-adenosylmethionine, 3-amino-4- 
hydroxybutyric acid, amixetrine, bendazac, benzydamine, bucolome, difenpiramide, 
ditazol, emorfazone, guaiazulene, nabumetone, nimesulide, orgotein, oxaceprol, 
paranyline, perisoxal, pifoxime, proquazone, proxazole, and tenidap. 

In another embodiment, compostions of the invention are administered in 
25 combination with a chemotherapeutic agent. Chemotherapeutic agents that may be 
administered with the Therapeutics of the invention include, but are not limited to, 
antibiotic derivatives (e.g., doxorubicin, bleomycin, daunorubicin, and dactinomycin); 
antiestrogens (e.g., tamoxifen); antimetabolites (e.g., fluorouracil, 5-FU, methotrexate, 
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floxuridine, interferon alpha-2b, glutamic acid, plicamycin, mercaptopurine, and 6- 
thioguanine); cytotoxic agents (e.g., carmustine, BCNU, lomustine, CCNU, cytosine 
arabinoside, cyclophosphamide, estramustine, hydroxyurea, procarbazine, mitomycin, 
busulfan, cis-plattn, and vincristine sulfate); hormones (e.g., medroxyprogesterone, 
5 estramustine phosphate sodium, ethinyl estradiol, estradiol, megestrol acetate, 
methyltestosterone, diethylstilbestrol diphosphate, chlorotrianisene, and testolactone); 
nitrogen mustard derivatives (e.g., mephalen, chorambucil, mechlorethamine (nitrogen 
mustard) and thiotepa); steroids and combinations (e.g., bethamethasone sodium 
phosphate); and others (e.g., dicarbazine, asparaginase, mitotane, vincristine sulfate, 
10 vinblastine sulfate, and etoposide). 

In a specific embodiment, Therapeutics of the invention are administered in 
combination with CHOP (cyclophosphamide, doxorubicin, vincristine, and prednisone) or 
any combination of the components of CHOP. In another embodiment, Therapeutics of 
the invention are administered in combination with Rituximab. In a further embodiment, 
15 Therapeutics of the invention are administered with Rituxmab and CHOP, or Rituxmab 
and any combination of the components of CHOP. 

In an additional embodiment, the Therapeutics of the invention are administered in 
combination with cytokines. Cytokines that may be administered with the Therapeutics of 
the invention include, but are not limited to, IL2, IL3, IL4, IL5, IL6, IL7, IL10, IL12, 
20 IL13, IL15, anti-CD40, CD40L, IFN-gamma and TNF-alpha. In another embodiment, 
Therapeutics of the invention may be administered with any interleukin, including, but not 
limited to, IL-lalpha, IL-lbeta, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-IO, IL-1 1, 
IL-12, IL-13, IL-14, IL-15, IL-16, IL-17, IL-18, IL-19, IL-20, and IL-21. 

In an additional embodiment, the Therapeutics of the invention are administered in 
25 combination with angiogenic proteins. Angiogenic proteins that may be administered with 
the Therapeutics of the invention include, but are not limited to, Glioma Derived Growth 
Factor (GDGF), as disclosed in European Patent Number EP-399816; Platelet Derived 
Growth Factor-A (PDGF-A), as disclosed in European Patent Number EP-6821 10; Platelet 
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Derived Growth Factor-B (PDGF-B), as disclosed in European Patent Number EP-282317; 
Placental Growth Factor (P1GF), as disclosed in International Publication Number WO 
92/06194; Placental Growth Factor-2 (P1GF-2), as disclosed in Hauser et al., Gorwth 
Factors, 4:259-268 (1993); Vascular Endothelial Growth Factor (VEGF), as disclosed in 
5 International Publication Number WO 90/13649; Vascular Endothelial Growth Factor-A 
(VEGF-A), as disclosed in European Patent Number EP-506477; Vascular Endothelial 
Growth Factor-2 (VEGF-2), as disclosed in International Publication Number WO 
96/39515; Vascular Endothelial Growth Factor B (VEGF-3); Vascular Endothelial Growth 
Factor B-186 ( VEGF-B 1 86), as disclosed in International Publication Number WO 

10 96/26736; Vascular Endothelial Growth Factor-D (VEGF-D), as disclosed in International 
Publication Number WO 98/02543; Vascular Endothelial Growth Factor-D (VEGF-D), as 
disclosed in International Publication Number WO 98/07832; and Vascular Endothelial 
Growth Factor-E (VEGF-E), as disclosed in German Patent Number DEI 9639601. The 
above mentioned references are incorporated herein by reference herein. 

15 In an additional embodiment, the Therapeutics of the invention are administered in 

combination with hematopoietic growth factors. Hematopoietic growth factors that may 
be administered with the Therapeutics of the invention include, but are not limited to, 
LEUKINE™ (SARGRAMOSTIM™) and NEUPOGEN™ (FILGRASTIM™). 

In an additional embodiment, the Therapeutics of the invention are administered in 

20 combination with Fibroblast Growth Factors. Fibroblast Growth Factors that may be 
administered with the Therapeutics of the invention include, but are not limited to, FGF-1, 
FGF-2, FGF-3, FGF-4, FGF-5, FGF-6, FGF-7, FGF-8, FGF-9, FGF-10, FGF-1 1, FGF-12, 
FGF-13, FGF-14, and FGF-15. 

In additional embodiments, the Therapeutics of the invention are administered in 
25 combination with other therapeutic or prophylactic regimens, such as, for example, 
radiation therapy. 



Example 24: Method o f Treating Decreased levels of the Polyp eptide 
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The present invention relates to a method for treating an individual in need of an 
increased level of a polypeptide of the invention in the body comprising administering to 
such an individual a composition comprising a therapeutically effective amount of an 
agonist of the invention (including polypeptides of the invention). Moreover, it will be 
appreciated that conditions caused by a decrease in the standard or normal expression level 
of a secreted protein in an individual can be treated by administering the polypeptide of the 
present invention, preferably in the secreted form. Thus, the invention also provides a 
method of treatment of an individual in need of an increased level of the polypeptide 
comprising administering to such an individual a Therapeutic comprising an amount of the 
polypeptide to increase the activity level of the polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily dose 
0.1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the polypeptide is in 
the secreted form. The exact details of the dosing scheme, based on administration and 
formulation, are provided in Example 23. 

Example 2fr Method of Treating Increased I ^ e Ls of the Pn^ ^A. 

The present invention also relates to a method of treating an individual in need of 
decreased level of a polypeptide of the invention in the body comprising administering 
such an individual a composition comprising a therapeutically effective amount of an 
antagonist of the invention (including polypeptides and antibodies of the invention). 

In one example, antisense technology is used to inhibit production of a polypeptide 
of the present invention. This technology is one example of a method of decreasing levels 
of a polypeptide, preferably a secreted form, due to a variety of etiologies, such 
For example, a patient diagnosed with abnormally increased levels of a polypeptide 
administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 2.0 and 3.0 mg/kg 
day for 21 days. This treatment is repeated after a 7-day rest period if the treatment was 
well tolerated. The formulation of the antisense polynucleotide is provided in Example 23 
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Efflmpfc 2fo Method of Treatment I Uinp f> ne Therapv.F.v Vi YA 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a subject 
by skin biopsy. The resulting tissue is placed in tissue-culture medium and separated into 
small pieces. Small chunks of the tissue are placed on a wet surface of a tissue culture 
flask, approximately ten pieces are placed in each flask. The flask is turned upside down, 
closed tight and left at room temperature over night. After 24 hours at room temperature, 
the flask is inverted and the chunks of tissue remain fixed to the bottom of the flask and 
fresh media (e.g., Ham's F12 media, with 10% FBS, penicillin and streptomycin) is added. 
The flasks are then incubated at 37 degree C for approximately one week. 

At this time, fresh media is added and subsequently changed every several days. 
After an additional two weeks in culture, a monolayer of fibroblasts emerge. The 
monolayer is trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T. et al., DNA, 7:219-25 (1988)), flanked by the long 
terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and HindUI 
and subsequently treated with calf intestinal phosphatase. The linear vector is fractionated 
on agarose gel and purified, using glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified using 
PCR primers which correspond to the 5' and 3' end sequences respectively as set forth in 
Example 1 using primers and having appropriate restriction sites and initiation/stop 
codons, if necessary. Preferably, the 5' primer contains an EcoRI site and the 3' primer 
includes a HindUI site. Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the amplified EcoRI and HindUI fragment are added together, in the 
presence of T4 DNA ligase. The resulting mixture is maintained under conditions 
appropriate for ligation of the two fragments. The ligation mixture is then used to 
transform bacteria HB101, which are then plated onto agar containing kanamycin for the 
purpose of confirming that the vector has the gene of interest properly inserted. 
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The amphotropic pA317 or GP+aml2 packaging cells arc grown in tissue culture to 
confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 10% calf serum 
(CS), penicillin and streptomycin. The MSV vector containing the gene is then added to 
the media and the packaging cells transduced with the vector. The packaging cells now 
5 produce infectious viral particles containing the gene (the packaging cells are now referred 
to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the media 
is harvested from a 10 cm plate of confluent producer cells. The spent media, containing 
the infectious viral particles, is filtered through a millipore filter to remove detached 

10 producer cells and this media is then used to infect fibroblast cells. Media is removed from 
a sub-confluent plate of fibroblasts and quickly replaced with the media from the producer 
cells. This media is removed and replaced with fresh media. If the titer of virus is high, 
then virtually all fibroblasts will be infected and no selection is required. If the titer is very 
low, then it is necessary to use a retroviral vector that has a selectable marker, such as neo 

15 or his. Once the fibroblasts have been efficiently infected, the fibroblasts are analyzed to 
determine whether protein is produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or after 
having been grown to confluence on cytodex 3 microcarrier beads. 

20 Example 27: Gene Theranv I Ising RnHn ff < .nn,, s G enfls fWr^ponriing T n 
PoivnuclfintiHre nf %> Invgntini^ 

Another method of gene therapy according to the present invention involves 
operably associating the endogenous polynucleotide sequence of the invention with a 
promoter via homologous recombination as described, for example, in U.S. Patent NO: 
25 5,641,670, issued June 24, 1997; International Publication NO: WO 96/29411, published 
September 26, 1 996; International Publication NO: WO 94/1 2650, published August 4, 
1994; Roller et al., Proc. Natl. Acad. Sci. USA, 86:8932-8935 (1989); and Zijlstra et al., 
Nature. 342:435-438 (1989). This method involves the activation of a gene which is 

SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCIYUS99/25031 

517 

present in the target cells, but which is not expressed in the cells, or is expressed at a lower 
level than desired. 

Polynucleotide constructs are made which contain a promoter and targeting 
sequences, which are homologous to the 5' non-coding sequence of endogenous 
polynucleotide sequence, flanking the promoter. The targeting sequence will be 
sufficiently near the 5' end of the polynucleotide sequence so the promoter will be operably 
linked to the endogenous sequence upon homologous recombination. The promoter and 
the targeting sequences can be amplified using PCR. Preferably, the amplified promoter 
contains distinct restriction enzyme sites on the 5* and 3' ends. Preferably, the 3' end of the 
first targeting sequence contains the same restriction enzyme site as the 5' end of the 
amplified promoter and the 5' end of the second targeting sequence contains the same 
restriction site as the 3' end of the amplified promoter. 

The amplified promoter and the amplified targeting sequences are digested with the 
appropriate restriction enzymes and subsequently treated with calf intestinal phosphatase. 
The digested promoter and digested targeting sequences are added together in the presence 
of T4 DNA ligase. The resulting mixture is maintained under conditions appropriate for 
ligation of the two fragments. The construct is size fractionated on an agarose gel then 
purified by phenol extraction and ethanol precipitation. 

In this Example, the polynucleotide constructs are administered as naked 
polynucleotides via electroporation. However, the polynucleotide constructs may also be 
administered with transfection-facilitating agents, such as liposomes, viral sequences, viral 
particles, precipitating agents, etc. Such methods of delivery are known in the art. 

Once the cells are transfected, homologous recombination will take place which 
results in the promoter being operably linked to the endogenous polynucleotide sequence. 
This results in the expression of polynucleotide corresponding to the polynucleotide in the 
cell. Expression may be detected by immunological staining, or any other method known 
in the art. 
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Fibroblasts are obtained from a subject by skin biopsy. The resulting tissue is 
placed in DMEM + 10% fetal calf serum. Exponentially growing or early stationary phase 
fibroblasts are trypsinized and rinsed from the plastic surface with nutrient medium. An 
aliquot of the cell suspension is removed for counting, and the remaining cells are 
5 subjected to centrifugation. The supernatant is aspirated and the pellet is resuspended in 5 
ml of electroporation buffer (20 mM HEPES pH 7.3, 137 mM NaCl, 5 mM KCI, 0.7 mM 
Na 2 HP0 4 , 6 mM dextrose). The cells are receritrifuged, the supernatant aspirated, and the 
cells resuspended in electroporation buffer containing 1 mg/ml acetylated bovine serum 
albumin. The final cell suspension contains approximately 3X10 6 cells/ml. Electroporation 

10 should be performed immediately following resuspension. 

Plasmid DNA is prepared according to standard techniques. For example, to 
construct a plasmid for targeting to the locus corresponding to the polynucleotide of the 
invention, plasmid pUC18 (MBI Fermentas, Amherst, NY) is digested with Hindlll. The 
CMV promoter is amplified by PCR with an Xbal site on the 5' end and a BamHl site on 

15 the 3'end. Two non-coding sequences are amplified via PCR: one non-coding sequence 
(fragment 1) is amplified with a Hindlll site at the 5* end and an Xba site at the 3'end; the 
other non-coding sequence (fragment 2) is amplified with a BamHI site at the 5'end and a 
Hindlll site at the 3'end. The CMV promoter and the fragments (1 and 2) are digested with 
the appropriate enzymes (CMV promoter - Xbal and BamHI; fragment 1 - Xbal; fragment 

20 2 - BamHI) and ligated together. The resulting ligation product is digested with Hindlll, 
and ligated with the Hindlll-digested pUC18 plasmid. 

Plasmid DNA is added to a sterile cuvette with a 0.4 cm electrode gap (Bio-Rad). 
The final DNA concentration is generally at least 120 /xg/ml. 0.5 ml of the cell suspension 
(containing approximately 1.5.XI0 6 cells) is then added to the cuvette, and the cell 

25 suspension and DNA solutions are gently mixed. Electroporation is performed with a 

Gene-Puiser apparatus (Bio-Rad). Capacitance and voltage are set at 960 fi¥ and 250-300 
V, respectively. As voltage increases, cell survival decreases, but the percentage of 
surviving cells that stably incorporate the introduced DNA into their genome increases 
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dramatically. Given these parameters, a pulse time of approximately 14-20 mSec should be 
observed. 

Electroporated cells are maintained at room temperature for approximately 5 min, 
and the contents of the cuvette are then gently removed with a sterile transfer pipette. The 
cells are added directly to 10 ml of prewarmed nutrient media (DMEM with 15% calf 
serum) in a 10 cm dish and incubated at 37 degree C. The following day, the media is 
aspirated and replaced with 10 ml of fresh media and incubated for a further 16-24 hours. 

The engineered fibroblasts are then injected into the host, either alone or after 
having been grown to confluence on cytodex 3 microcarrier beads. The fibroblasts i 
produce the protein product. The fibroblasts can then be introduced into a patient : 
described above. 



i now 

:as 



Example 28: Method of Treatment Il ^ ng C.eno Therapy - In Vi n 

Another aspect of the present invention is using in vivo gene therapy methods to 
treat disorders, diseases and conditions. The gene therapy method relates to the 
introduction of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) sequences 
into an animal to increase or decrease the expression of the polypeptide. The 
polynucleotide of the present invention may be operatively linked to a promoter or any 
other genetic elements necessary for the expression of the polypeptide by the target tissue. 
Such gene therapy and delivery techniques and methods are known in the art, see, for 
example, WO90/11092, W098/11779; U.S. Patent NO. 5693622, 5705151, 5580859; 
Tabata et al., Cardiovasc. Res. 35(3):470-479 (1997); Chao et al., Pharmacol. Res. 
35(6):517-522 (1997); Wolff, Neuromuscul. Disord. 7(5):314-318 (1997); Schwartz et al., 
Gene Ther. 3(5):405-411 (1996); Tsurumi et al., Circulation 94(12):328 1-3290 (1996) 
(incorporated herein by reference). 

The polynucleotide constructs may be delivered by any method that delivers 
injectable materials to the cells of an animal, such as, injection into the interstitial space of 
tissues (heart, muscle, skin, lung, liver, intestine and the like). The polynucleotide 
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constructs can be delivered in a pharmaceutically acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences that are free 
from any delivery vehicle that acts to assist, promote, or facilitate entry into the cell, 
including viral sequences, viral particles, liposome formulations, lipofectin or precipitating 
5 agents and the like. However, the polynucleotides of the present invention may also be 
delivered in liposome formulations (such as those taught in Feigner P.L. et al. (1995) Ann. 
NY Acad. Sci. 772:126-139 and Abdallah B. et al. (1995) Biol. Cell 85(l):l-7) which can 
be prepared by methods well known to those skilled in the art. 

The polynucleotide vector constructs used in the gene therapy method are 

0 preferably constructs that will not integrate into the host genome nor will they contain 
sequences that allow for replication. Any strong promoter known to those skilled in the art 
can be used for driving the expression of DNA. Unlike other gene therapies techniques, 
one major advantage of introducing naked nucleic acid sequences into target cells is the 
transitory nature of the polynucleotide synthesis in the cells. Studies have shown that non- 

1 replicating DNA sequences can be introduced into cells to provide production of the 
desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the interstitial space of tissues 
within the an animal, including of muscle, skin, brain, lung, liver, spleen, bone marrow, 
thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, stomach, 
intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and connective tissue. 
Interstitial space of the tissues comprises the intercellular fluid, mucopolysaccharide 
matrix among the reticular fibers of organ tissues, elastic fibers in the walls of vessels or 
chambers, collagen fibers of fibrous tissues, or that same matrix within connective tissue 
ensheathing muscle cells or in the lacunae of bone. It is similarly the space occupied by 
the plasma of the circulation and the lymph fluid of the lymphatic channels. Delivery to 
the interstitial space of muscle tissue is preferred for the reasons discussed below. They 
may be conveniently delivered by injection into the tissues comprising these cells. They 
are preferably delivered to and expressed in persistent, non-dividing cells which are 
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differentiated, although delivery and expression may be achieved in non-differentiated or 
less completely differentiated cells, such as, for example, stem cells of blood or skin 
fibroblasts. In vivo muscle cells are particularly competent in their ability to take up and 
express polynucleotides. 

For the naked polynucleotide injection, an effective dosage amount of DNA or 
RNA will be in the range of from about 0.05 g/kg body weight to about 50 mg/kg body 
weight. Preferably the dosage will be from about 0.005 mg/kg to about 20 mg/kg and 
more preferably from about 0.05 mg/kg to about 5 mg/kg. Of course, as the artisan of 
ordinary skill will appreciate, this dosage will vary according to the tissue site of injection. 
The appropriate and effective dosage of nucleic acid sequence can readily be determined 
by those of ordinary skill in the art and may depend on the condition being treated and the 
route of administration. The preferred route of administration is by the parenteral route of 
injection into the interstitial space of tissues. However, other parenteral routes may also be 
used, such as, inhalation of an aerosol formulation particularly for delivery to lungs or 
bronchial tissues, throat or mucous membranes of the nose. In addition, naked 
polynucleotide constructs can be delivered to arteries during angioplasty by the catheter 
used in the procedure. 

The dose response effects of injected polynucleotide in muscle in vivo is 
determined as follows. Suitable template DNA for production of mRNA coding for 
polypeptide of the present invention is prepared in accordance with a standard recombinant 
DNA methodology. The template DNA, which may be either circular or linear, is either 
used as naked DNA or complexed with liposomes. The quadriceps muscles of mice are 
then injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1.5 cm incision is made on the 
anterior thigh, and the quadriceps muscle is directly visualized. The template DNA is 
injected in 0.1 ml of carrier in a 1 cc syringe through a 27 gauge needle over one minute, 
approximately 0.5 cm from the distal insertion site of the muscle into the knee and about 
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0.2 cm deep. A suture is placed over the injection site for future localization, and the skin 
is closed with stainless steel clips. 

After an appropriate incubation time (e.g., 7 days) muscle extracts are prepared by 
excising the entire quadriceps. Every fifth 15 urn cross-section of the individual 
quadriceps muscles is histochemically stained for protein expression. A time course for 
protein expression may be done in a similar fashion except that quadriceps from different 
mice are harvested at different times. Persistence of DNA in muscle following injection 
may be determined by Southern blot analysis after preparing total cellular DNA and HIRT 
supematants from injected and control mice. The results of the above experimentation in 
mice can be use to extrapolate proper dosages and other treatment parameters in humans 
and other animals using naked DNA. 

Example 2 9: Transo ni c Animals. 

The polypeptides of the invention can also be expressed in transgenic animals. 
Animals of any species, including, but not limited to, mice, rats, rabbits, hamsters, guinea 
pigs, pigs, micro-pigs, goats, sheep, cows and non-human primates, e.g., baboons, 
monkeys, and chimpanzees may be used to generate transgenic animals. In a specific 
embodiment, techniques described herein or otherwise known in the art, are used to 
express polypeptides of the invention in humans, as part of a gene therapy protocol. 

Any technique known in the art may be used to introduce the transgene (i.e., 
polynucleotides of the invention) into animals to produce the founder lines of transgenic 
animals. Such techniques include, but are not limited to, pronuclear microinjection 
(Paterson et al., Appl. Microbiol. Biotechnol. 40:691-698 (1994); Carver et al., 
Biotechnology (NY) 11:1263-1270 (1993); Wright et al., Biotechnology (NY) 9:830-834 
(1991); and Hoppe et al., U.S. Pat. No. 4,873,191 (1989)); retrovirus mediated gene 
transfer into germ lines (Van der Putten et al., Proc. Natl. Acad. Sci., USA 82:6148-6152 
(1985)), blastocysts or embryos; gene targeting in embryonic stem cells (Thompson et al.. 
Cell 56:313-321 (1989)); electroporation of cells or embryos (Lo, 1983, Mol Cell. Biol. 
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3:1803-1814 (1983)); introduction of the polynucleotides of the invention using a gene gun 
(see, e.g., Ulmer et a]., Science 259:1745 (1993); introducing nucleic acid constructs into 
embryonic pleuripotent stem cells and transferring the stem cells back into the blastocyst; 
and sperm-mediated gene transfer (Lavitrano et al., Cell 57:717-723 (1989); etc. For a 
5 review of such techniques, see Gordon, "Transgenic Animals," Intl. Rev, Cytol. 115:171- 
229 (1989), which is incorporated by reference herein in its entirety. 

Any technique known in the art may be used to produce transgenic clones 
containing polynucleotides of the invention, for example, nuclear transfer into enucleated 
oocytes of nuclei from cultured embryonic, fetal, or adult cells induced to quiescence 

10 (Campell et al, Nature 380:64-66 (1996); Wilmut et al., Nature 385:810-813 (1997)). 

The present invention provides for transgenic animals that carry the transgene in all 
their cells, as well as animals which carry the transgene in some, but not all their cells, i.e.. 
mosaic animals or chimeric. The transgene may be integrated as a single transgene or as 
multiple copies such as in concatamers, e.g., head-to-head tandems or head-to-tail 

15 tandems. The transgene may also be selectively introduced into and activated in a 
particular cell type by following, for example, the teaching of Lasko et al. (Lasko et al., 
Proc. Natl. Acad. Sci. USA 89:6232-6236 (1992)). The regulatory sequences required for 
such a cell-type specific activation will depend upon the particular cell type of interest, and 
will be apparent to those of skill in the art. When it is desired that the polynucleotide 

20 transgene be integrated into the chromosomal site of the endogenous gene, gene targeting 
is preferred. Briefly, when such a technique is to be utilized, vectors containing some 
nucleotide sequences homologous to the endogenous gene are designed for the purpose of 
integrating, via homologous recombination with chromosomal sequences, into and 
disrupting the function of the nucleotide sequence of the endogenous gene. The transgene 

25 may also be selectively introduced into a particular cell type, thus inactivating the 
endogenous gene in only that cell type, by following, for example, the teaching of Gu et al. 
(Gu et al., Science 265:103-106 (1994)). The regulatory sequences required for such a 
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cell-type specific inactivation will depend upon the particular cell type of interest, and will 
be apparent to those of skill in the art. 

Once transgenic animals have been generated, the expression of the recombinant 
gene may be assayed utilizing standard techniques. Initial screening may be accomplished 
by Southern blot analysis or PCR techniques to analyze animal tissues to verify that 
integration of the transgene has taken place. The level of mRNA expression of the 
transgene in the tissues of the transgenic animals may also be assessed using techniques 
which include, but are not limited to, Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and reverse transcriptase-PCR (rt-PCR). 
Samples of transgenic gene-expressing tissue may also be evaluated 
immunocytochemically or immunohistochemically using antibodies specific for the 
transgene product. 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular animal. Examples of such breeding 
strategies include, but are not limited to: outbreeding of founder animals with more than 
one integration site in order to establish separate lines; inbreeding of separate lines in order 
to produce compound transgenics that express the transgene at higher levels because of the 
effects of additive expression of each transgene; crossing of heterozygous transgenic 
animals to produce animals homozygous for a given integration site in order to both 
augment expression and eliminate the need for screening of animals by DNA analysis; 
crossing of separate homozygous lines to produce compound heterozygous or homozygous 
lines; and breeding to place the transgene on a distinct background that is appropriate for 
an experimental model of interest. 

Transgenic animals of the invention have uses which include, but are not limited to, 
animal model systems useful in elaborating the biological function of polypeptides of the 
present invention, studying conditions and/or disorders associated with aberrant 
expression, and in screening for compounds effective in ameliorating such conditions 
and/or disorders. 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 



525 



PCT/US99/25031 



Example 30: Knock-Ont Apim«i ? 

Endogenous gene expression can also be reduced by inactivating or "knocking out" 
the gene and/or its promoter using targeted homologous recombination. (E.g., see 
Smithies et al., Nature 317:230-234 (1985); Thomas & Capecchi, Cell 51:503-512 (1987); 
Thompson et al., Cell 5:313-321 (1989); each of which is incorporated by reference herein 
in its entirety). For example, a mutant, non-functional polynucleotide of the invention (or 
a completely unrelated DNA sequence) flanked by DNA homologous to the endogenous 
polynucleotide sequence (either the coding regions or regulatory regions of the gene) can 
be used, with or without a selectable marker and/or a negative selectable marker, to 
transfect cells that express polypeptides of the invention in vivo. In another embodiment, 
techniques known in the art are used to generate knockouts in cells that contain, but do not 
express the gene of interest. Insertion of the DNA construct, via targeted homologous 
recombination, results in inactivation of the targeted gene. Such approaches are 
particularly suited in research and agricultural fields where modifications to embryonic 
stem cells can be used to generate animal offspring with an inactive targeted gene {e.g., see 
Thomas & Capecchi 1987 and Thompson 1989, supra). However this approach can be 
routinely adapted for use in humans provided the recombinant DNA constructs are directly 
administered or targeted to the required site in vivo using appropriate viral vectors that will 
be apparent to those of skill in the art. 

In further embodiments of the invention, cells that are genetically engineered to 
express the polypeptides of the invention, or alternatively, that are genetically engineered 
not to express the polypeptides of the invention (e.g., knockouts) are administered to a 
patient in vivo. Such cells may be obtained from the patient (i.e., animal, including 
human) or an MHC compatible donor and can include, but are not limited to fibroblasts, 
bone marrow cells, blood cells fe^, lymphocytes), adipocytes, muscle cells, endothelial 
cells etc. The cells are genetically engineered in vitro using recombinant DNA techniques 
to introduce the coding sequence of polypeptides of the invention into the cells, or 
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alternatively, to disrupt the coding sequence and/or endogenous regulatory sequence 
associated with the polypeptides of the invention, e^, by transduction (using viral vectors, 
and preferably vectors that integrate the transgene into the cell genome) or transfection 
procedures, including, but not limited to, the use of plasmids, cosmids, YACs, naked 
5 DNA, electroporation, liposomes, etc. The coding sequence of the polypeptides of the 
invention can be placed under the control of a strong constitutive or inducible promoter or 
promoter/enhancer to achieve expression, and preferably secretion, of the polypeptides of 
the invention. The engineered cells which express and preferably secrete the polypeptides 
of the invention can be introduced into the patient systemically, e.g., in the circulation, or 
10 intraperitoneally. 

Alternatively, the cells can be incorporated into a matrix and implanted in the body, 
fi*. genetically engineered fibroblasts can be implanted as part of a skin graft; genetically 
engineered endothelial cells can be implanted as part of a lymphatic or vascular graft. 
(See, for example, Anderson et al. U.S. Patent No. 5,399,349; and Mulligan & Wilson, 
15 U.S. Patent No : 5,460,959 each of which is incorporated by reference herein in its 
entirety). 

When the cells to be administered are non-autologous or non-MHC compatible 

cells, they can be administered using well known techniques which prevent the 

development of a host immune response against the introduced cells. For example, the 
20 cells may be introduced in an encapsulated form which, while allowing for an exchange of 

components with the immediate extracellular environment, does not allow the introduced 

cells to be recognized by the host immune system. 

Transgenic and "knock-out" animals of the invention have uses which include, but 

are not limited to, animal model systems useful in elaborating the biological function of 
25 polypeptides of the present invention, studying conditions and/or disorders associated with 

aberrant expression, and in screening for compounds effective in ameliorating such 

conditions and/or disorders. 
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Emmpte 31 , Twrtatjow of antibody fraPments dirgrfpH m , ains t pnlyp ^ptirt^ n f th„ 
invention frnm a library of srFvs. 

Naturally occurring V-genes isolated from human PBLs are constructed into a 
large library of antibody fragments which contain reactivities against a polypeptide 
5 having the amino acid sequence of SEQ ID NO:Y to which the donor may or may not 
have been exposed (see e.g., U.S. Patent 5,885,793 incorporated herein in its entirety by 
reference). 

Rescue of the library. 

A library of scFvs is constructed from the RNA of human PBLs as described in 

10 WO92/01047. To rescue phage displaying antibody fragments, approximately 10" E. 
coli harboring the phagemid are used to inoculate 50 ml of 2x TY containing 1% 
glucose and 100 micrograms/ml of ampicillin (2xTY-AMP-GLU) and grown to an O.D. 
of 0.8 with shaking. Five ml of this culture is used to inoculate 50 ml of 2xTY-AMP- 
GLU, 2 x 108 TU of delta gene 3 helper (M13 delta gene III, see WO92/01047) are 

15 added and the culture incubated at 37'C for 45 minutes without shaking and then at 
37°C for 45 minutes with shaking. The culture is centrifuged at 4000 r.p.m. for 10 min. 
and the pellet resuspended in 2 liters of of 2x TY containing 100 micrograms/ml 
ampicillin and 50 micrograms/ml kanamycin and grown overnight. Phage are prepared 
as described in WO92/01047. 

20 M13 delta gene III is prepared as follows: M13 delta gene III helper phage does 

not encode gene III protein, hence the phage(mid) displaying antibody fragments have a 
greater avidity of binding to antigen. Infectious M13 delta gene III particles are made by 
growing the helper phage in cells harboring a pUC19 derivative supplying the wild type 
gene III protein during phage morphogenesis. The culture is incubated for 1 hour at 

25 37°C without shaking and then for a further hour at 37°C with shaking. Cells were spun 
down (IEC-Centra 8, 4000 revs/min for 10 min), resuspended in 300 ml 2x TY broth 
containing 100 micrograms ampicillin/ml and 25 micrograms kanamycin/ml (2x TY- 
AMP-KAN) and grown overnight, shaking at 37'C. Phage particles are purified and 
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concentrated from theculture medium by two PEG-precipitations (Sambrook et al., 
1990), resuspended in 2 ml PBS and passed through a 0.45 micrometer filter (Minisart 
NML; Sartorius) to give a final concentration of approximately 10" transducing 
units/ml (ampicillin-resistant clones). 
Panning the Library. 

Immunotubes (Nunc) are coated overnight in PBS with 4 ml of either 100 
micrograms/ml or 10 micrograms/ml of a polypeptide of the present invention. Tubes 
are blocked with 2% Marvel-PBS for 2 hours at 37°C and then washed 3 times in PBS. 
Approximately 10" TU of phage is applied to the tube and incubated for 30 minutes at 
room temperature tumbling on an over and under turntable and then left to stand for 
another 1.5 hours. Tubes are washed 10 times with PBS 0.1% Tween-20 and 10 times 
with PBS. Phage are eluted by adding 1 ml of 100 mM triethylamine and rotating 15 
minutes on an under and over turntable after which the solution is immediately 
neutralized with 0.5 ml of 1.0M Tris-HCl, pH 7.4. Phage are then used to infect 10 ml 
of mid-log E. coli TGI by incubating eluted phage with bacteria for 30 minutes at 37°C. 
The E. coli are then plated on TYE plates containing 1% glucose and 100 
micrograms/ml ampicillin. The resulting bacterial library is then rescued with delta 
gene 3 helper phage as described above to prepare phage for a subsequent round of 
selection. This process is then repeated for a total of 4 rounds of affinity purification 
with tube-washing increased to 20 times with PBS, 0. 1% Tween-20 and 20 times with 
PBS for rounds 3 and 4. 

Characterization of Binders. 

Eluted phage from the third and fourth rounds of selection are used to infect E. coli 
HB 2151 and soluble scFv is produced (Marks, et al., 1991) from single colonies for 
assay. ELISAs are performed with microliter plates coated with either 10 picograms/ml of 
the polypeptide of the present invention in 50 mM bicarbonate pH 9.6. Clones positive in 
ELISA are further characterized by PCR fingerprinting (see e.g., WO92/01047) and then 
by sequencing. 
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Eyampte 32; Assays Detecting Stimulation or Inhibit!™ o f B cell Proliferate „nH 
Differentiation 

Generation of functional humoral immune responses requires both soluble and cognate 
5 signaling between B-lineage cells and their microenvironment. Signals may impart a positive 
stimulus that allows a B-lineage cell to continue its programmed development, or a negative 
stimulus that instructs the cell to arrest its current developmental pathway. To dale, numerous 
stimulatory and inhibitory signals have been found to influence B cell responsiveness including 
IL-2, IL-4, IL-5, IL-6, IL-7, IL10, IL-13, IL-14 and IL-15. Interestingly, these signals are by 

10 themselves weak effectors but can, in combination with various co-stimulatory proteins, induce 
activation, proliferation, differentiation, homing, tolerance and death among B cell populations. 

One of the best studied classes of B-cell co-stimulatory proteins is the TNF- 
superfamily. Within this family CD40, CD27, and CD30 along with their respective ligands 
CD154, CD70, and CD153 have been found to regulate a variety of immune responses. Assays 

1 5 which allow for the detection and/or observation of the proliferation and differentiation of these 
B-cell populations and their precursors are valuable tools in determining the effects various 
proteins may have on these B-cell populations in terms of proliferation and differentiation. 
Listed below are two assays designed to allow for the detection of the differentiation, 
proliferation, or inhibition of B-cell populations and their precursors. 

20 In Vitro Assay- Purified polypeptides of the invention, or truncated forms thereof, 

is assessed for its ability to induce activation, proliferation, differentiation or inhibition 
and/or death in B-cell populations and their precursors. The activity of the polypeptides of 
the invention on purified human tonsillar B cells, measured qualitatively over the dose 
range from 0.1 to 10,000 ng/mL, is assessed in a standard B-lymphocyte co-stimulation 

25 assay in which purified tonsillar B cells are cultured in the presence of either formalin- 
fixed Staphylococcus aureus Cowan I (SAC) or immobilized anti-human IgM antibody as 
the priming agent. Second signals such as IL-2 and IL-15 synergize with SAC and IgM 
crosslinking to elicit B cell proliferation as measured by tritiated-thymidine incorporation. 
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Novel synergizing agents can be readily identified using this assay. The assay involves 
isolating human tonsillar B cells by magnetic bead (MACS) depletion of CD3-positive 
cells. The resulting cell population is greater than 95% B cells as assessed by expression 
ofCD45R(B220). 

Various dilutions of each sample are placed into individual wells of a 96-well plate 
to which are added 10 5 B-cells suspended in culture medium (RPMI 1640 containing 10% 
FBS, 5 X 10" 5 M 2ME, lOOU/ml penicillin, lOug/ml streptomycin, and 10" 5 dilution of 
SAC) in a total volume of 150ul. Proliferation or inhibition is quantitated by a 20h pulse 
(luCi/well) with 3H-thymidine (6.7 Ci/mM) beginning 72h post factor addition. The 
positive and negative controls are IL2 and medium respectively. 

In Vivo Assay- BALB/c mice are injected (i.p.) twice per day with buffer only, or 
2 mg/Kg of a polypeptide of the invention, or truncated forms thereof. Mice receive this 
treatment for 4 consecutive days, at which time they are sacrificed and various tissues and 
serum collected for analyses. Comparison of H&E sections from normal spleens and 
spleens treated with polypeptides of the invention identify the results of the activity of the 
polypeptides on spleen cells, such as the diffusion of peri-arterial lymphatic sheaths, and/or 
significant increases in the nucleated cellularity of the red pulp regions, which may 
indicate the activation of the differentiation and proliferation of B-cell populations. 
Immunohistochemical studies using a B cell marker, anti-CD45R(B220), are used to 
determine whether any physiological changes to splenic cells, such as splenic 
disorganization, are due to increased B-cell representation within loosely defined B-cell 
zones that infiltrate established T-cell regions. 

Flow cytometric analyses of the spleens from mice treated with polypeptide is used 
to indicate whether the polypeptide specifically increases the proportion of ThB+, 
CD45R(B220)dull B cells over that which is observed in control mice. 

Likewise, a predicted consequence of increased mature B-cell representation in 
vivo is a relative increase in serum Ig titers. Accordingly, serum IgM and IgA levels are 
compared between buffer and polypeptide-treated mice. 
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The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides of the invention (e.g., gene therapy), agonists, and/or 
antagonists of polynucleotides or polypeptides of the invention. 

5 

Example 33: T Col l Proliferation /U^ y 

A CD3-induced proliferation assay is performed on PBMCs and is measured by the 
uptake of 3 H-thymidine. The assay is performed as follows. Ninety-six well plates are 
coated with 100 ul/well of mAb to CD3 (HIT3a, Pharmingen) or isotype-matched control 
10 mAb (B33.1) overnight at 4 degrees C (1 ug/ml in .05M bicarbonate buffer, pH 9.5), then 
washed three times with PBS. PBMC are isolated by F/H gradient centrifugation from 
human peripheral blood and added to quadruplicate wells (5 x lOVwell) of mAb coated 
plates in RPMI containing 10% FCS and P/S in the presence of varying concentrations of 
polypeptides of the invention (total volume 200 ul). Relevant protein buffer and medium 

15 alone are controls. After 48 hr. culture at 37 degrees C, plates are spun for 2 min. at 1000 
rpm and 100 ul of supernatant is removed and stored -20 degrees C for measurement of 
IL-2 (or other cytokines) if effect on proliferation is observed. Wells are supplemented 
with 100 ul of medium containing 0.5 uCi of 3 H-thymidine and cultured at 37 degrees C 
for 18-24 hr. Wells are harvested and incorporation of 3 H-lhymidine used as a measure of 

20 proliferation. Anti-CD3 alone is the positive control for proliferation. IL-2 (100 U/ml) is 
also used as a control which enhances proliferation. Control antibody which does not 
induce proliferation of T cells is used as the negative controls for the effects of 
polypeptides of the invention. 

The studies described in this example tested activity of polypeptides of the 

25 invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides of the invention (e.g., gene therapy), agonists, and/or 
antagonists of polynucleotides or polypeptides of the invention. 
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Example 34; Fffrrt of Polype ptide-* of the Invention nn th» Exp ression nfMHP r, ayf 
tt, CQStimulatorY and Adhesjon Mo lecules and C ell Differentia™ o f Monocytes g ^ 
Mpnocyte-Derived Human r>enHrfti«. r~\fc 

Dendritic cells are generated by the expansion of proliferating precursors found in 
5 the peripheral blood: adherent PBMC or elutriated monocytic fractions are cultured for 7- 
10 days with GM-CSF (50 ng/ml) and IL-4 (20 ng/ml). These dendritic cells have the 
characteristic phenotype of immature cells (expression of CD!, CD80, CD86, CD40 and 
MHC class II antigens). Treatment with activating factors, such as TNF-a, causes a rapid 
change in surface phenotype (increased expression of MHC class I and II, costimulatory 
10 and adhesion molecules, downregulation of FCyRII, upregulation of CD83). These 
changes correlate with increased antigen-presenting capacity and with functional 
maturation of the dendritic cells. 

FACS analysis of surface antigens is performed as follows. Cells are treated 1-3 
days with increasing concentrations of polypeptides of the invention or LPS (positive 
15 control), washed with PBS containing 1% BSA and 0.02 mM sodium azide, and then 
incubated with 1:20 dilution of appropriate FITC- or PE-Iabeled monoclonal antibodies for 
30 minutes at 4 degrees C. After an additional wash, the labeled cells are analyzed by flow 
cytometry on a FACScan (Becton Dickinson). 



20 



Effect on the production of cytokines . Cytokines generated by dendritic cells, in 
particular IL-12, are important in the initiation of T-cell dependent immune responses. IL- 
12 strongly influences the development of Thl helper T-cell immune response, and induces 
cytotoxic T and NK cell function. An ELISA is used to measure the IL-12 release as 
follows. Dendritic cells (lOVml) are treated with increasing concentrations of polypeptides 
25 of the invention for 24 hours. LPS (100 ng/ml) is added to the cell culture as positive 
control. Supernatants from the cell cultures are then collected and analyzed for IL-12 
content using commercial ELISA kit (e..g, R&D Systems (Minneapolis, MN)). The 
standard protocols provided with the kits are used. 
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Effect on the expression of MHC Class II, costimulatory and adhesion molecules. 
Three major families of cell surface antigens can be identified on monocytes: adhesion 
molecules, molecules involved in antigen presentation, and Fc receptor. Modulation of the 
expression of MHC class II antigens and other costimulatory molecules, such as B7 and 
ICAM-1, may result in changes in the antigen presenting capacity of monocytes and ability 
to induce T cell activation. Increase expression of Fc receptors may correlate with 
improved monocyte cytotoxic activity, cytokine release and phagocytosis. 

FACS analysis is used to examine the surface antigens as follows. Monocytes are 
treated 1-5 days with increasing concentrations of polypeptides of the invention or LPS 
(positive control), washed with PBS containing 1% BSA and 0.02 mM sodium azide, and 
then incubated with 1:20 dilution of appropriate FITC- or PE-labeled monoclonal 
antibodies for 30 minutes at 4 degreesC. After an additional wash, the labeled cells are 
analyzed by flow cytometry on a FACScan (Becton Dickinson). 

Monocyte activation and/or increased survival Assays for molecules that activate 
(or alternatively, inactivate) monocytes and/or increase monocyte survival (or 
alternatively, decrease monocyte survival) are known in the art and may routinely be 
applied to determine whether a molecule of the invention functions as an inhibitor or 
activator of monocytes. Polypeptides, agonists, or antagonists of the invention can be 
screened using the three assays described below. For each of these assays, Peripheral 
blood mononuclear cells (PBMC) are purified from single donor leukopacks (American 
Red Cross, Baltimore, MD) by centrifugation through a Histopaque gradient (Sigma). 
Monocytes are isolated from PBMC by counterflow centrifugal elutriation. 

Monocyte Survival Assay. Human peripheral blood monocytes progressively lose 
viability when cultured in absence of serum or other stimuli. Their death results from 
internally regulated process (apoptosis). Addition to the culture of activating factors, such 
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as TNF-alpha dramatically improves *ell survival and prevents DNA fragmentation. 
Propidium iodide (PI) staining is used to measure apoptosis as follows. Monocytes are 
cultured for 48 hours in polypropylene tubes in serum-free medium (positive control), in 
the presence of 100 ng/ml TNF-alpha (negative control), and in the presence of varying 
concentrations of the compound to be tested. Cells are suspended at a concentration of 2 x 
107ml in PBS containing PI at a final concentration of 5 ug/ml, and then incubaed at room 
temperature for 5 minutes before FACScan analysis. PI uptake has been demonstrated to 
correlate with DNA fragmentation in this experimental paradigm. 

Effect on cytokine release An important function of monocytes/macrophages is 
their regulatory activity on other cellular populations of the immune system through the 
release of cytokines after stimulation. An ELISA to measure cytokine release is performed 
as follows. Human monocytes are incubated at a density of 5x10* cells/ml with increasing 
concentrations of the a polypeptide of the invention and under the same conditions, but in 
the absence of the polypeptide. For IL-12 production, the cells are primed overnight with 
IFN (100 U/ml) in presence of a polypeptide of the invention. LPS (10 ng/ml) is then 
added. Conditioned media are collected after 24h and kept frozen until use. Measurement 
of TNF-alpha, IL-10, MCP-1 and IL-8 is then performed using a commercially available 
ELISA kit (e..g, R&D Systems (Minneapolis, MN)) and applying the standard protocols 
provided with the kit. 

Oxidative burst. Purified monocytes are plated in 96-w plate at 2-lxlO 5 cell/well. 
Increasing concentrations of polypeptides of the invention are added to the wells in a total 
volume of 0.2 ml culture medium (RPMI 1640 + 10% FCS, glutamine and antibiotics). 
After 3 days incubation, the plates are centrifuged and the medium is removed from the 
wells. To the macrophage monolayers, 0.2 ml per well of phenol red solution (140 mM 
NaCl, 10 mM potassium phosphate buffer pH 7.0, 5.5 mM dextrose, 0.56 mM phenol red 
and 19 U/ml of HRPO) is added, together with the stimulant (200 nM PMA). The plates 

SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

535 

are incubated at 37°C for 2 hours and the reaction is stopped by adding 20 fi\ IN NaOH 
per well. The absorbance is read at 610 nm. To calculate the amount of H 2 0 2 produced by 
the macrophages, a standard curve of a H 2 0 2 solution of known molarity is performed for 
each experiment. 

5 The studies described in this example tested activity of a polypeptide of the 

invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polypeptides, polynucleotides (e.g., gene therapy), agonists, and/or 
antagonists of the invention. 

10 Example 35: Biological Effects of Polypeptides of the Invention 

Astrocyte and Neuronal Assays - 

Recombinant polypeptides of the invention, expressed in Escherichia coli and 
purified as described above, can be tested for activity in promoting the survival, neurite 
outgrowth, or phenotypic differentiation of cortical neuronal cells and for inducing the 

] 5 proliferation of glial fibrillary acidic protein immunopositive cells, astrocytes. The 
selection of cortical cells for the bioassay is based on the prevalent expression of FGF-1 
and FGF-2 in cortical structures and on the previously reported enhancement of cortical 
neuronal survival resulting from FGF-2 treatment. A thymidine incorporation assay, for 
example, can be used to elucidate a polypeptide of the invention's activity on these cells. 

20 Moreover, previous reports describing the biological effects of FGF-2 (basic FGF) 

on cortical or hippocampal neurons in vitro have demonstrated increases in both neuron 
survival and neurite outgrowth (Walicke et al., "Fibroblast growth factor promotes survival 
of dissociated hippocampal neurons and enhances neurite extension." Proc. Natl. Acad 
ScL USA 53:3012-3016. (1986), assay herein incorporated by reference in its entirety). 

25 However, reports from experiments done on PC- 12 cells suggest that these two responses 
are not necessarily synonymous and may depend on not only which FGF is being tested 
but also on which receptor(s) are expressed on the target cells. Using the primary cortical 
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neuronal culture paradigm, the ability of a polypeptide of the invention to induce neurite 
outgrowth can be compared to the response achieved with FGF-2 using, for example, a 
thymidine incorporation assay. 



Fibroblast and endothelial cell assays - 

Human lung fibroblasts are obtained from Clonetics (San Diego, CA) and 
maintained in growth media from Clonetics. Dermal microvascular endothelial cells are 
obtained from Cell Applications (San Diego, CA). For proliferation assays, the human 
lung fibroblasts and dermal microvascular endothelial cells can be cultured at 5,000 
cells/well in a 96-well plate for one day in growth medium. The cells are then incubated 
for one day in 0. 1 % BS A basal medium. After replacing the medium with fresh 0. 1 % BSA 
medium, the cells are incubated with the test proteins for 3 days. Alamar Blue (Alamar 
Biosciences, Sacramento, CA) is added to each well to a final concentration of 10%. The 
cells are incubated for 4 hr. Cell viability is measured by reading in a CytoFluor 
fluorescence reader. For the PGE, assays, the human lung fibroblasts are cultured at 5,000 
cells/well in a 96-well plate for one day. After a medium change to 0.1% BSA basal 
medium, the cells are incubated with FGF-2 or polypeptides of the invention with or 
without IL-la for 24 hours. The supernatants are collected and assayed for PGE, by EIA 
kit (Cayman, Ann Arbor, MI). For the IL-6 assays, the human lung fibroblasts are cultured 
at 5,000 cells/well in a 96-well plate for one day. After a medium change to 0.1% BSA 
basal medium, the cells are incubated with FGF-2 or with or without polypeptides of the 
invention IL-la for 24 hours. The supernatants are collected and assayed for IL-6 by 
ELISA kit (Endogen, Cambridge, MA). 

Human lung fibroblasts are cultured with FGF-2 or polypeptides of the invention 
for 3 days in basal medium before the addition of Alamar Blue to assess effects on growth 
of the fibroblasts. FGF-2 should show a stimulation at 10 - 2500 ng/ml which can be used 
to compare stimulation with polypeptides of the invention. 
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Parkinson Models . 

The loss of motor function in Parkinson's disease is attributed to a deficiency of 
striatal dopamine resulting from the degeneration of the nigrostriatal dopaminergic 
projection neurons. An animal model for Parkinson's that has been extensively 
characterized involves the systemic administration of l-methyl-4 phenyl 1,2,3,6- 
tetrahydropyridine (MPTP). In the CNS, MPTP is taken-up by astrocytes and catabolized 
by monoamine oxidase B to l-methyl-4-phenyl pyridine (MPP + ) and released. 
Subsequently, MPP + is actively accumulated in dopaminergic neurons by the high-affinity 
reuptake transporter for dopamine. MPP + is then concentrated in mitochondria by the 
electrochemical gradient and selectively inhibits nicotidamide adenine disphosphate: 
ubiquinone oxidoreductionase (complex 1), thereby interfering with electron transport and 
eventually generating oxygen radicals. 

It has been demonstrated in tissue culture paradigms that FGF-2 (basic FGF) has 
trophic activity towards nigral dopaminergic neurons (Ferrari et al„ Dev. Biol. 1989). 
Recently, Dr. Unsicker's group has demonstrated that administering FGF-2 in gel foam 
implants in the striatum results in the near complete protection of nigral dopaminergic 
neurons from the toxicity associated with MPTP exposure (Otto and Unsicker, J. 
Neuroscience, 1990). 

Based on the data with FGF-2, polypeptides of the invention can be evaluated to 
determine whether it has an action similar to that of FGF-2 in enhancing dopaminergic 
neuronal survival in vitro and it can also be tested in vivo for protection of dopaminergic 
neurons in the striatum from the damage associated with MPTP treatment. The potential 
effect of a polypeptide of the invention is first examined in vitro in a dopaminergic 
neuronal ceil culture paradigm. The cultures are prepared by dissecting the midbrain floor 
plate from gestation day 14 Wistar rat embryos. The tissue is dissociated with trypsin and 
seeded at a density of 200,000 cells/cm 2 on polyorthinine-laminin coated glass coverslips. 
The cells are maintained in Dulbecco's Modified Eagle's medium and F12 medium 
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containing hormonal supplements (Nl). The cultures are fixed with paraformaldehyde 
after 8 days in vitro and are processed for tyrosine hydroxylase, a specific marker for 
dopminergic neurons, immunohistochemical staining. Dissociated cell cultures are 
prepared from embryonic rats. The culture medium is changed every third day and the 
factors are also added at that time. 

Since the dopaminergic neurons are isolated from animals at gestation day 14, a 
developmental time which is past the stage when the dopaminergic precursor cells are 
proliferating, an increase in the number of tyrosine hydroxylase immunopositive neurons 
would represent an increase in the number of dopaminergic neurons surviving in vitro. 
Therefore, if a polypeptide of the invention acts to prolong the survival of dopaminergic 
neurons, it would suggest that the polypeptide may be involved in Parkinson's Disease. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 



Example 3ft The Effect of Polypeptides nf Hu> In v ention nn >Hp r.r Qwt h of V a ^„lar 
Endothelial CVII^ 

On day 1, human umbilical vein endothelial cells (HUVEC) are seeded at 2-5xl0 4 
cells/35 mm dish density in M199 medium containing 4% fetal bovine serum (FBS), 16 
units/ml heparin, and 50 units/ml endothelial cell growth supplements (ECGS, 
Biotechnique, Inc.). On day 2, the medium is replaced with M199 containing 10% FBS, 8 
units/ml heparin. A polypeptide having the amino acid sequence of SEQ ID NO:Y, and 
positive controls, such as VEGF and basic FGF (bFGF) are added, at varying 
concentrations. On days 4 and 6, the medium is replaced. On day 8, cell number is 
determined with a Coulter Counter. 

An increase in the number of HUVEC cells indicates that the polypeptide of the 
invention may proliferate vascular endothelial cells. 
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The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 37; Stimulatory Effect of Poly p eptides of the Invention on the Proliferati on 
of Vascula r Endothelial Q lk 

For evaluation of mitogenic activity of growth factors, the colorimetric MTS 
(3-(4,5-dimethylthiazol-2-yl)-5-(3-carboxymethoxyphenyl)-2-(4-sulfophenyl)2H- 
tetrazolium) assay with the electron coupling reagent PMS (phenazine methosulfate) was 
performed (CellTiter 96 AQ, Promega). Cells are seeded in a 96-well plate (5,000 
cells/well) in 0.1 mL serum-supplemented medium and are allowed to attach overnight. 
After serum-starvation for 12 hours in 0.5% FBS, conditions (bFGF, VEGF I6J or a 
polypeptide of the invention in 0.5% FBS) with or without Heparin (8 U/ml) are added to 
wells for 48 hours. 20 mg of MTS/PMS mixture (1:0.05) are added per well and allowed 
to incubate for 1 hour at 37°C before measuring the absorbance at 490 nm in an ELISA 
plate reader. Background absorbance from control wells (some media, no cells) is 
subtracted, and seven wells are performed in parallel for each condition. See, Leak el al. 
In Vitro Cell. Dev. Biol. J0A/512-518 (1994). 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 38: Inhibition of PDGF-induced V ascular Smo o t h Muscle Cell Proliferation 
Stimulatory Effect 

HAoSMC proliferation can be measured, for example, by BrdUrd incorporation. 
Briefly, subconfluent, quiescent cells grown on the 4-chamber slides are transfected with 
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CRP or FITC-labeled AT2-3LP. Then, the cells are pulsed with 10% calf serum and 6 
mg/ml BrdUrd. After 24 h, immunocytochemistry is performed by using BrdUrd Staining 
Kit (Zymed Laboratories). In brief, the cells are incubated with the biotinylated mouse 
anti-BrdUrd antibody at 4 degrees C for 2 h after being exposed to denaturing solution and 
then incubated with the streptavidin-peroxidase and diaminobenzidine. After 
counterstaining with hematoxylin, the cells are mounted for microscopic examination, and 
the BrdUrd-positive cells are counted. The BrdUrd index is calculated as a percent of the 
BrdUrd-positive cells to the total cell number. In addition, the simultaneous detection of 
the BrdUrd staining (nucleus) and the FITC uptake (cytoplasm) is performed for individual 
cells by the concomitant use of bright field illumination and dark field-UV fluorescent 
illumination. See, Hayashida et al., J. Biol. Chem. 6:271(36):21985-21992 (1996). 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 



Example 39: Stimulate of Endothelial Migration 

This example will be used to explore the possibility that a polypeptide of the 
invention may stimulate lymphatic endothelial cell migration. 

Endothelial cell migration assays are performed using a 48 well microchemotaxis 
chamber (Neuroprobe Inc., Cabin John, MD; Falk, W., et al., J. Immunological Methods 
1980;33:239-247). Polyvinyipyrrolidone-free polycarbonate filters with a pore size of 8 
um (Nucleopore Corp. Cambridge, MA) are coated with 0.1% gelatin for at least 6 hours at 
room temperature and dried under sterile air. Test substances are diluted to appropriate 
concentrations in M199 supplemented with 0.25% bovine serum albumin (BSA), and 25 ul 
of the final dilution is placed in the lower chamber of the modified Boyden apparatus. 
Subconfluent, early passage (2-6) HUVEC or BMEC cultures are washed and trypsinized 
for the minimum time required to achieve cell detachment. After placing the filter between 
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lower and upper chamber, 2.5 x 10 s cells suspended in 50 ul M199 containing 1% FBS are 
seeded in the upper compartment. The apparatus is then incubated for 5 hours at 37°C in a 
humidified chamber with 5% C02 to allow cell migration. After the incubation period, the 
filter is removed and the upper side of the filter with the non-migrated cells is scraped with 
5 a rubber policeman. The filters are fixed with methanol and stained with a Giemsa 
solution (Diff-Quick, Baxter, McGraw Park, IL). Migration is quantified by counting cells 
of three random high-power fields (40x) in each well, and all groups are performed in 
quadruplicate. 

The studies described in this example tested activity of a polypeptide of the 
0 invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 



Example 40: Stimulation of Nitric Oxide Production bv Endothelial (Vlk 

Nitric oxide released by the vascular endothelium is believed to be a mediator of 
vascular endothelium relaxation. Thus, activity of a polypeptide of the invention can be 
assayed by determining nitric oxide production by endothelial cells in response to the 
polypeptide. 

Nitric oxide is measured in 96-well plates of confluent microvascular endothelial 
cells after 24 hours starvation and a subsequent 4 hr exposure to various levels of a positive 
control (such as VEGF-1) and the polypeptide of the invention. Nitric oxide in the 
medium is determined by use of the Griess reagent to measure total nitrite after reduction 
of nitric oxide-derived nitrate by nitrate reductase. The effect of the polypeptide of the 
invention on nitric oxide release is examined on HUVEC. 

Briefly, NO release from cultured HUVEC monolayer is measured with a NO- 
specific polarographic electrode connected to a NO meter (Iso-NO, World Precision 
Instruments Inc.) (1049). Calibration of the NO elements is performed according to the 
following equation: 
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2KN0 2 + 2KI+2H 2 S0 4 6 2NO + I 2 + 2H 2 0 + 2K 2 S0 4 
The standard calibration curve is obtained by adding graded concentrations of 
KN0 2 (0, 5, 10, 25, 50, 100, 250, and 500 nmol/L) into the calibration solution containing 
KI and H 2 S0 4 . The specificity of the Iso-NO electrode to NO is previously determined by 
5 measurement of NO from authentic NO gas (1050). The culture medium is removed and 
HUVECs are washed twice with Dulbecco's phosphate buffered saline. The cells are then 
bathed in 5 ml of filtered Krebs-Henseleit solution in 6-well plates, and the cell plates are 
kept on a slide warmer (Lab Line Instruments Inc.) To maintain the temperature at 37°C. 
The NO sensor probe is inserted vertically into the wells, keeping the tip of the electrode 2 
10 mm under the surface of the solution, before addition of the different conditions. S-nitroso 
acetyl penicillamin (SNAP) is used as a positive control. The amount of released NO is 
expressed as picomoles per 1x10' endothelial cells. All values reported are means of four 
to six measurements in each group (number of cell culture wells). See, Leak el al. 
Biochem. and Biophys. Res. Comm. 21 7:96- 1 05 ( 1 995). 
15 The studies described in this example tested activity of polypeptides of the 

invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 



20 g^"»Ple4l: Effect of Polypfpides of the Inv-ntinn m r^ d Formation in 
Angiopenesis 

Another step in angiogenesis is cord formation, marked by differentiation of 
endothelial cells. This bioassay measures the ability of microvascular endothelial cells to 
form capillary-like structures (hollow structures) when cultured in vitro. 

CADMEC (microvascular endothelial cells) are purchased from Cell Applications, 
Inc. as proliferating (passage 2) cells and are cultured in Cell Applications' CADMEC 
Growth Medium and used at passage 5. For the in vitro angiogenesis assay, the wells of a 
48-well cell culture plate are coated with Cell Applications' Attachment Factor Medium 
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(200 ml/well) for 30 min. at 37°C. CADMEC are seeded onto the coated wells at 7,500 
cells/well and cultured overnight in Growth Medium. The Growth Medium is then 
replaced with 300 mg Cell Applications' Chord Formation Medium containing control 
buffer or a polypeptide of the invention (0.1 to 100 ng/ml) and the cells are cultured for an 
5 additional 48 hr. The numbers and lengths of the capillary-like chords are quantitated 
through use of the Boeckeler VIA-170 video image analyzer. All assays are done in 
triplicate. 

Commercial (R&D) VEGF (50 ng/ml) is used as a positive control, b-esteradiol (1 
ng/ml) is used as a negative control. The appropriate buffer (without protein) is also 
10 utilized as a control. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

15 

Example 42: Angiogenic Effect nn Chick Chor ioallantoic Memhranp 

Chick chorioallantoic membrane (CAM) is a well-established system to examine 
angiogenesis. Blood vessel formation on CAM is easily visible and quantifiable. The 
ability of polypeptides of the invention to stimulate angiogenesis in CAM can be 
20 examined. 

Fertilized eggs of the White Leghorn chick {Gallus gallus) and the Japanese qual 
(Coturnix coturnix) are incubated at 37.8°C and 80% humidity. Differentiated CAM of 
16-day-old chick and 13-day-old qual embryos is studied with the following methods. 

On Day 4 of development, a window is made into the egg shell of chick eggs. The 
25 embryos are checked for normal development and the eggs sealed with cellotape. They are 
further incubated until Day 13. Thermanox coverslips (Nunc, Naperville, IL) are cut into 
disks of about 5 mm in diameter. Sterile and salt-free growth factors are dissolved in 
distilled water and about 3.3 mg/ 5 ml are pipetted on the disks. After air-drying, the 
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inverted disks are applied on CAM. After 3 days, the specimens are fixed in 3% 
glutaraldehyde and 2% formaldehyde and rinsed in 0.12 M sodium cacodylate buffer. 
They are photographed with a stereo microscope [Wild M8] and embedded for semi- and 
ultrathin sectioning as described above. Controls are performed with carrier disks alone. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 



10 Example Anagenesis Assay UsinP a l mD i anf in Mn.,c» 

In vivo angiogenesis assay of a polypeptide of the invention measures the ability of 
an existing capillary network to form new vessels in an implanted capsule of murine 
extracellular matrix material (Matrigel). The protein is mixed with the liquid Matrigel at 4 
degree C and the mixture is then injected subcutaneously in mice where it solidifies. After 
15 7 days, the solid "plug" of Matrigel is removed and examined for the presence of new 
blood vessels. Matrigel is purchased from Becton Dickinson Labware/Collaborative 
Biomedical Products. 

When thawed at 4 degree C the Matrigel material is a liquid. The Matrigel is 
mixed with a polypeptide of the invention at 150 ng/ml at 4 degrees C and drawn into cold 

20 3 mi syringes. Female C57B1/6 mice approximately 8 weeks old are injected with the 
mixture of Matrigel and experimental protein at 2 sites at the midventral aspect of the 
abdomen (0.5 ml/site). After 7 days, the mice are sacrificed by cervical dislocation, the 
Matrigel plugs are removed and cleaned (i.e., all clinging membranes and fibrous tissue is 
removed). Replicate whole plugs are fixed in neutral buffered 10% formaldehyde, 

25 embedded in paraffin and used to produce sections for histological examination after 
staining with Masson's Trichrome. Cross sections from 3 different regions of each plug 
are processed. Selected sections are stained for the presence of vWF. The positive control 
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for this assay is bovine basic FGF (150 ng/ml). Matrigel alone is used to determine basal 
levels of angiogenesis. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 44; Rescue of Ischemia in Rnhh j t \, nwer i ,imh M nH»i 

To study the in vivo effects of polynucleotides and polypeptides of the invention on 
ischemia, a rabbit hindlimb ischemia model is created by surgical removal of one femoral 
arteries as described previously (Takeshita et al.,Am J. Pathol 747:1649-1660 (1995)). 
The excision of the femoral artery results in retrograde propagation of thrombus and 
occlusion of the external iliac artery. Consequently, blood flow to the ischemic limb is 
dependent upon collateral vessels originating from the internal iliac artery (Takeshitaer al. 
Am J. Pathol 147: 1649-1660 (1995)). An interval of 10 days is allowed for post-operative 
recovery of rabbits and development of endogenous collateral vessels. At 10 day post- 
operatively (day 0), after performing a baseline angiogram, the internal iliac artery of the 
ischemic limb is transfected with 500 mg naked expression plasmid containing a 
polynucleotide of the invention by arterial gene transfer technology using a hydrogel- 
coated balloon catheter as described (Riessen et al. Hum Gene Ther. 4:749-758 (1993); 
Leclerc et al. J. Clin. Invest. 90: 936-944 (1992)). When a polypeptide of the invention is 
used in the treatment, a single bolus of 500 mg polypeptide of the invention or control is 
delivered into the internal iliac artery of the ischemic limb over a period of 1 min. through 
an infusion catheter. On day 30, various parameters are measured in these rabbits: (a) BP 
ratio - The blood pressure ratio of systolic pressure of the ischemic limb to that of normal 
limb; (b) Blood Flow and Flow Reserve - Resting FL: the blood flow during undilated 
condition and Max FL: the blood flow during fully dilated condition (also an indirect 
measure of the blood vessel amount) and Flow Reserve is reflected by the ratio of max FL: 
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resting FL; (c) Angiographic Score - This is measured by the angiogram of collateral 
vessels. A score is determined by the percentage of circles in an overlaying grid that with 
crossing opacified arteries divided by the total number m the rabbit thigh; (d) Capillary 
density - The number of collateral capillaries determined in light microscopic sections 
taken from hindlimbs. 

The studies described in this example tested activity of polynucleotides and 
polypeptides Of the invention. However, one skilled in the art could easily modify the 
exemplified studies to test the agonists, and/or antagonists of the invention. 

Example 4& Effect of Polypeptides of the Inve n tion on V^ l^n 

Since dilation of vascular endothelium is important in reducing blood pressure, the 
ability of polypeptides of the invention to affect the blood pressure in spontaneously 
hypertensive rats (SHR) is examined. Increasing doses (0, 10, 30, 100, 300, and 900 
mg/kg) of the polypeptides of the invention are administered to 13-14 week old 
spontaneously hypertensive rats (SHR). Data are expressed as the mean +/- SEM. 
Statistical analysis are performed with a paired t-test and statistical significance is defined 
as p<0.05 vs. the response to buffer alone. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 



Example 46: Rat Isrfr? mir Skin Flap M 0 H f I 

The evaluation parameters include skin blood flow, skin temperature, and factor 
VIII immunohistochemistry or endothelial alkaline phosphatase reaction. Expression of 
polypeptides of the invention, during the skin ischemia, is studied using in situ 
hybridization. 

The study in this model is divided into three parts as follows: 
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a) Ischemic skin 

b) Ischemic skin wounds 

c) Normal wounds 

The experimental protocol includes: 

a) Raising a 3x4 cm, single pedicle full-thickness random skin flap (myocutaneous 
flap over the lower back of the animal). 

b) An excisional wounding (4-6 mm in diameter) in the ischemic skin (skin-flap). 

c) Topical treatment with a polypeptide of the invention of the excisional wounds 
(day 0, 1, 2, 3, 4 post-wounding) at the following various dosage ranges: lmg to 100 mg. 

d) Harvesting the wound tissues at day 3, 5, 7, 10, 14 and 21 post-wounding for 
histological, immunohistochemical, and in situ studies. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 



Example 47: Peripheral Arterial Ok^co M»H 0 | 

Angiogenic therapy using a polypeptide of the invention is a novel therapeutic 
strategy to obtain restoration of blood flow around the ischemia in case of peripheral 
arterial diseases. The experimental protocol includes: 

a) One side of the femoral artery is ligated to create ischemic muscle of 
the hindlimb, the other side of hindlimb serves as a control. 

b) a polypeptide of the invention, in a dosage range of 20 mg - 500 mg, is delivered 
intravenously and/or intramuscularly 3 times (perhaps more) per week for 2-3 weeks. 

c) The ischemic muscle tissue is collected after ligation of the femoral 

artery at 1 , 2, and 3 weeks for the analysis of expression of a polypeptide of the invention 
and histology. Biopsy is also performed on the other side of normal muscle of the 
contralateral hindlimb. 
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The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 



Example 48: Ischemic Myocardial n jsease MnH-l 

A polypeptide of the invention is evaluated as a potent mitogen capable of 
stimulating the development of collateral vessels, and restructuring new vessels after 
coronary artery occlusion. Alteration of expression of the polypeptide is investigated in 
situ. The experimental protocol includes: 

a) The heart is exposed through a left-side thoracotomy in the rat. Immediately, the 
left coronary artery is occluded with a thin suture (6-0) and the thorax is closed. 

b) a polypeptide of the invention, in a dosage range of 20 mg - 500 mg, is delivered 
intravenously and/or intramuscularly 3 times (perhaps more) per week for 2-4 weeks. 

c) Thirty days after the surgery, the heart is removed and cross-sectioned 
for morphometric and in situ analyzes. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 



Example 49; Rat Cornel Wound M^i 



This animal model shows the effect of a polypeptide of the invention on 
neovascularization. The experimental protocol includes: 

a) Making a 1-1.5 mm long incision from the center of cornea into the stromal 

layer. 
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b) Inserting a spatula below the lip of the incision facing the outer corner of 
the eye. 

c) Making a pocket (its base is 1 - 1 .5 mm form the edge of the eye). 

d) Positioning a pellet, containing 50ng- 5ug of a polypeptide of the invention, 
within the pocket. 

e) Treatment with a polypeptide of the invention can also be applied topically 
to the corneal wounds in a dosage range of 20mg - 500mg (daily treatment for five days). 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 



Example 50; Diabetic Mouse and fil..™rm-H coiri.lmp a irPrt Wound H^linp p inA^c 
A. Diabetic db+/db+ Mouse Model. 

To demonstrate that a polypeptide of the invention accelerates the healing process, 
the genetically diabetic mouse model of wound healing is used. The full thickness wound 
healing model in the db+/db+ mouse is a well characterized, clinically relevant and 
reproducible model of impaired wound healing. Healing of the diabetic wound is 
dependent on formation of granulation tissue and re-epithelialization rather than 
contraction (Gartner, M.H. et al., J. Surg. Res. 52:389 (1992); Greenhalgh, D.G. et al., Am. 
J. Pathol. 136:1235 (1990)). 

The diabetic animals have many of the characteristic features observed in Type II 
diabetes mellitus. Homozygous (db+/db+) mice are obese in comparison to their normal 
heterozygous (db+/+m) littermates. Mutant diabetic (db+/db+) mice have a single 
autosomal recessive mutation on chromosome 4 (db+) (Coleman et al. Proc. Natl. Acad. 
Sci. USA 77:283-293 (1982)). Animals show polyphagia, polydipsia and polyuria. Mutant 
diabetic mice (db+/db+) have elevated blood glucose, increased or normal insulin levels, 
and suppressed cell-mediated immunity (Mandel et al, J. Immunol. 720:1375 (1978); 
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Debray-Sachs, M. et al, Clin. Exp. Immunol. 51(l):l-l (1983); Leiter et al.,Am. J. of 
Pathol. 114:46-55 (1985)). Peripheral neuropathy, myocardial complications, and 
microvascular lesions, basement membrane thickening and glomerular filtration 
abnormalities have been described in these animals (Norido, F. et al., Exp. Neurol. 
5 83(2):22l-232 (1984); Robertson et al., Diabetes 29(1):60-61 (1980); Giacomelli et al, 
Lab Invest. 40(4)A60A13 (1979); Coleman, D.L., Diabetes 31 (Suppl):\-6 (1982)). These 
homozygous diabetic mice develop hyperglycemia that is resistant to insulin analogous to 
human type II diabetes (Mandel etal.J. Immunol. 720:1375-1377 (1978)). 

The characteristics observed in these animals suggests that healing in this model 
10 may be similar to the healing observed in human diabetes (Greenhalgh, et al.. Am. J. of 
Pathol. 136: 1235-1246 (1990)). 

Genetically diabetic female C57BL/KsJ (db+/db + ) mice and their non-diabetic 
(db+/+m) heterozygous littermates are used in this study (Jackson Laboratories). The 
animals are purchased at 6 weeks of age and are 8 weeks old at the beginning of the study. 
15 Animals are individually housed and received food and water ad libitum. All 
manipulations are performed using aseptic techniques. The experiments are conducted 
according to the rules and guidelines of Human Genome Sciences, Inc. Institutional 
Animal Care and Use Committee and the Guidelines for the Care and Use of Laboratory 
Animals. 

20 Wounding protocol is performed according to previously reported methods 

(Tsuboi, R. and Rifkin, D.B., J. Exp. Med. 172:245-251 (1990)). Briefly, on the day of 
wounding, animals are anesthetized with an intraperitoneal injection of Avertin (0.01 
mg/mL), 2,2,2-tribromoethanol and 2-methyl-2-butanol dissolved in deionized water. The 
dorsal region of the animal is shaved and the skin washed with 70% ethane! solution and 

25 iodine. The surgical area is dried with sterile gauze prior to wounding. An 8 mm full- 
thickness wound is then created using a Keyes tissue punch. Immediately following 
wounding, the surrounding skin is gently stretched to eliminate wound expansion. The 
wounds are left open for the duration of the experiment. Application of the treatment is 
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given topically for 5 consecutive days commencing on the day of wounding. Prior to 
treatment, wounds are gently cleansed with sterile saline and gauze sponges. 

Wounds are visually examined and photographed at a fixed distance at the day of 
surgery and at two day intervals thereafter. Wound closure is determined by daily 
measurement on days 1-5 and on day 8. Wounds are measured horizontally and vertically 
using a calibrated Jameson caliper. Wounds are considered healed if granulation tissue is 
no longer visible and the wound is covered by a continuous epithelium. 

A polypeptide of the invention is administered using at a range different doses, 
from 4mg to 500mg per wound per day for 8 days in vehicle. Vehicle control groups 
received 50mL of vehicle solution. 

Animals are euthanized on day 8 with an intraperitoneal injection of sodium 
pentobarbital (300mg/kg). The wounds and surrounding skin are then harvested for 
histology and immunohistochemistry. Tissue specimens are placed in 10% neutral 
buffered formalin in tissue cassettes between biopsy sponges for further processing. 

Three groups of 10 animals each (5 diabetic and 5 non-diabetic controls) are 
evaluated: 1) Vehicle placebo control, 2) untreated group, and 3) treated group. 

Wound closure is analyzed by measuring the area in the vertical and horizontal axis 
and obtaining the total square area of the wound. Contraction is then estimated by 
establishing the differences between the initial wound area (day 0) and that of post 
treatment (day 8). The wound area on day 1 is 64mm 2 , the corresponding size of the 
dermal punch. Calculations are made using the following formula: 

[Open area on day 8] - [Open area on day 1] / [Open area on day 1] 



are 



Specimens are fixed in 10% buffered formalin and paraffin embedded blocks 
sectioned perpendicular to the wound surface (5mm) and cut using a Reichert-Jung 
microtome. Routine hematoxylin-eosin (H&E) staining is performed on cross-sections of 
bisected wounds. Histologic examination of the wounds are used to assess whether the 
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healing process and the morphologic appearance of the repaired skin is altered by 
treatment with a polypeptide of the invention. This assessment included verification of the 
presence of cell accumulation, inflammatory cells, capillaries, fibroblasts, re- 
epithelialization and epidermal maturity (Greenhalgh, D.G. et al., Am. J. Pathol. 136:1235 
(1990)). A calibrated lens micrometer is used by a blinded observer. 

Tissue sections are also stained immunohistochemically with a polyclonal rabbit 
anti-human keratin antibody using ABC Elite detection system. Human skin is used as a 
positive tissue control while non-immune IgG is used as a negative control. Keratinocyte 
growth is determined by evaluating the extent of reepithelialization of the wound using a 
calibrated lens micrometer. 

Proliferating cell nuclear antigen/cyclin (PCNA) in skin specimens is demonstrated 
by using anti-PCNA antibody (1:50) with an ABC Elite detection system. Human colon 
cancer can serve as a positive tissue control and human brain tissue can be used as a 
negative tissue control. Each specimen includes a section with omission of the primary 
antibody and substitution with non-immune mouse IgG. Ranking of these sections is based 
on the extent of proliferation on a scale of 0-8, the lower side of the scale reflecting slight 
proliferation to the higher side reflecting intense proliferation. 

Experimental data are analyzed using an unpaired t test. A p value of < 0.05 is 
considered significant. 

B. Steroid Impaired Rat Model 

The inhibition of wound healing by steroids has been well documented in various 
in vitro and in vivo systems (Wahl, Glucocorticoids and Wound healing. In: Anti- 
Inflammatory Steroid Action: Basic and Clinical Aspects. 280-302 (1989); Wahle/a/., J. 
Immunol. 115: 476-481 (1975); Werb et al, J. Exp. Med. 747:1684-1694 (1978)). 
Glucocorticoids retard wound healing by inhibiting angiogenesis, decreasing vascular 
permeability (Ebert et al., An. Intern. Med. 57:701-705 (1952)), fibroblast proliferation, 
and collagen synthesis (Becked/., Growth Factors. 5:295-304(1991); Haynes et al., J. 
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Clin. Invest. 61: 703-797 (1978)) and producing a transient reduction of circulating 
monocytes (Haynes et al., J. Clin. Invest. 61: 703-797 (1978); Wahl, "Glucocorticoids and 
wound healing", In: Antiinflammatory Steroid Action: Basic and Clinical Aspects, 
Academic Press, New York, pp. 280-302 (1989)). The systemic administration of steroids 
to impaired wound healing is a well establish phenomenon in rats (Beck et al, Growth 
Factors. 5: 295-304 (1991); Haynes et al, J. Clin. Invest. 61: 703-797 (1978); Wahl, 
"Glucocorticoids and wound healing", In: Antiinflammatory Steroid Action: Basic and 
Clinical Aspects, Academic Press, New York, pp. 280-302 (1989); Pierce et al, Proc. 
Natl Acad. ScL USA 86: 2229-2233 (1989)). 

To demonstrate that a polypeptide of the invention can accelerate the healing 
process, the effects of multiple topical applications of the polypeptide on full thickness 
excisional skin wounds in rats in which healing has been impaired by the systemic 
administration of methylprednisolone is assessed. 

Young adult male Sprague Dawley rats weighing 250-300 g (Charles River 
Laboratories) are used in this example. The animals are purchased at 8 weeks of age and 
are 9 weeks old at the beginning of the study. The healing response of rats is impaired by 
the systemic administration of methylprednisolone (17mg/kg/rat intramuscularly) at the 
time of wounding. Animals are individually housed and received food and water ad 
libitum. All manipulations are performed using aseptic techniques. This study is 
conducted according to the rules and guidelines of Human Genome Sciences, Inc. 
Institutional Animal Care and Use Committee and the Guidelines for the Care and Use of 
Laboratory Animals. 

The wounding protocol is followed according to section A, above. On the day of 
wounding, animals are anesthetized with an intramuscular injection of ketamine (50 
mg/kg) and xylazine (5 mg/kg). The dorsal region of the animal is shaved and the skin 
washed with 70% ethanol and iodine solutions. The surgical area is dried with sterile 
gauze prior to wounding. An 8 mm full-thickness wound is created using a Keyes tissue 
punch. The wounds are left open for the duration of the experiment. Applications of the 
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testing materials are given topically once a day for 7 consecutive days commencing on the 
day of wounding and subsequent to methylprednisolone administration. Prior to treatment, 
wounds are gently cleansed with sterile saline and gauze sponges. 

Wounds are visually examined and photographed at a fixed distance at the day of 
wounding and at the end of treatment. Wound closure is determined by daily measurement 
on days 1-5 and on day 8. Wounds are measured horizontally and vertically using a 
calibrated Jameson caliper. Wounds are considered healed if granulation tissue is no longer 
visible and the wound is covered by a continuous epithelium. 

The polypeptide of the invention is administered using at a range different doses, 
from 4mg to 500mg per wound per day for 8 days in vehicle. Vehicle control groups 
received 50mL of vehicle solution. 

Animals are euthanized on day 8 with an intraperitoneal injection of sodium 
pentobarbital (300mg/kg). The wounds and surrounding skin are then harvested for 
histology. Tissue specimens are placed in 10% neutral buffered formalin in tissue cassettes 
between biopsy sponges for further processing. 

Four groups of 10 animals each (5 with methylprednisolone and 5 without 
glucocorticoid) are evaluated: 1) Untreated group 2) Vehicle placebo control 3) treated 
groups. 

Wound closure is analyzed by measuring the area in the vertical and horizontal axis 
and obtaining the total area of the wound. Closure is then estimated by establishing the 
differences between the initial wound area (day 0) and that of post treatment (day 8). The 
wound area on day 1 is 64mm 2 , the corresponding size of the dermal punch. Calculations 
are made using the following formula: 

(Open area on day 8] - [Open area on day 1) / [Open area on day 1] 

Specimens are fixed in 10% buffered formalin and paraffin embedded blocks are 
sectioned perpendicular to the wound surface (5mm) and cut using an Olympus 
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microtome. Routine hematoxylin-eosin (H&E) staining is performed on cross-sections of 
bisected wounds. Histologic examination of the wounds allows assessment of whether the 
healing process and the morphologic appearance of the repaired skin is improved by 
treatment with a polypeptide of the invention. A calibrated lens micrometer is used by a 
blinded observer to determine the distance of the wound gap. 

Experimental data are analyzed using an unpaired t test. A p value of < 0.05 is 
considered significant. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 



Example 51: Lvmpha dema Animal Model 

or The purpose of this experimental approach is to create an appropriate and 
consistent lymphedema model for testing the therapeutic effects of a polypeptide of the 
invention in lymphangiogenesis and re-establishment of the lymphatic circulatory system 
in the rat hind limb. Effectiveness is measured by swelling volume of the affected limb, 
quantification of the amount of lymphatic vasculature, total blood plasma protein, and 
histopathology. Acute lymphedema is observed for 7-10 days. Perhaps more importantly, 
the chronic progress of the edema is followed for up to 3-4 weeks. 

Prior to beginning surgery, blood sample is drawn for protein concentration 
analysis. Male rats weighing approximately ~350g are dosed with Pentobarbital. 
Subsequently, the right legs are shaved from knee to hip. The shaved area is swabbed with 
gauze soaked in 70% EtOH. Blood is drawn for serum total protein testing. 
Circumference and volumetric measurements are made prior to injecting dye into paws 
after marking 2 measurement levels (0.5 cm above heel, at mid-pt of dorsal paw). The 
intradermal dorsum of both right and left paws are injected with 0.05 ml of 1% Evan's 
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Blue. Circumference and volumetric measurements are then made following injection of 
dye into paws. 

Using the knee joint as a landmark, a mid-leg inguinal incision is made 
circumferentially allowing the femoral vessels to be located. Forceps and hemostats are 
5 used to dissect and separate the skin flaps. After locating the femoral vessels, the 
lymphatic vessel that runs along side and underneath the vessel(s) is located. The main 
lymphatic vessels in this area are then electrically coagulated suture ligated. 

Using a microscope, muscles in back of the leg (near the semitendinosis and 
adductors) are bluntly dissected. The popliteal lymph node is then located. The 2 
10 proximal and 2 distal lymphatic vessels and distal blood supply of the popliteal node are 
then and ligated by suturing. The popliteal lymph node, and any accompanying adipose 
tissue, is then removed by cutting connective tissues. 

Care is taken to control any mild bleeding resulting from this procedure. After 
lymphatics are occluded, the skin flaps are sealed by using liquid skin (Vetbond) (AJ 
15 Buck). The separated skin edges are sealed to the underlying muscle tissue while leaving a 
gap of -0.5 cm around the leg. Skin also may be anchored by suturing to underlying 
muscle when necessary. 

To avoid infection, animals are housed individually with mesh (no bedding). 
Recovering animals are checked daily through the optimal edematous peak, which 

20 typically occurred by day 5-7. The plateau edematous peak are then observed. To 
evaluate the intensity of the lymphedema, the circumference and volumes of 2 designated 
places on each paw before operation and daily for 7 days are measured. The effect plasma 
proteins on lymphedema is determined and whether protein analysis is a useful testing 
perimeter is also investigated. The weights of both control and edematous limbs are 

25 evaluated at 2 places. Analysis is performed in a blind manner. 

Circumference Measurements: Under brief gas anesthetic to prevent limb 
movement, a cloth tape is used to measure limb circumference. Measurements are done at 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

557 

the ankle bone and dorsal paw by 2 different people then those 2 readings are averaged. 
Readings are taken from both control and edematous limbs. 

Volumetric Measurements: On the day of surgery, animals are anesthetized with 
Pentobarbital and are tested prior to surgery. For daily volumetrics animals are under brief 
halothane anesthetic (rapid immobilization and quick recovery), both legs are shaved and 
equally marked using waterproof marker on legs. Legs are first dipped in water, then 
dipped into instrument to each marked level then measured by Buxco edema 
software(Chen/Victor). Data is recorded by one person, while the other is dipping the limb 
to marked area. 

Blood-plasma protein measurements: Blood is drawn, spun, and serum separated 
prior to surgery and then at conclusion for total protein and Ca2+ comparison. 

Limb Weight Comparison: After drawing blood, the animal is prepared for tissue 
collection. The limbs are amputated using a quillitine, then both experimental and control 
legs are cut at the ligature and weighed. A second weighing is done as the tibio-cacaneal 
joint is disarticulated and the foot is weighed. 

Histological Preparations: The transverse muscle located behind the knee 
(popliteal) area is dissected and arranged in a metal mold, filled with freezeGel, dipped 
into cold methylbutane, placed into labeled sample bags at - 80EC until sectioning. Upon 
sectioning, the muscle is observed under fluorescent microscopy for lymphatics.. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 



Example 52: Suppression o f TNF alph a-in duce d adhesinn mnl m ilc» exp ression h y a 
Polypeptide f the Inventio n 

The recruitment of lymphocytes to areas of inflammation and angiogenesis 
involves specific receptor- 1 igand interactions between cell surface adhesion molecules 
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(CAMs) on lymphocytes and the vascular endothelium. The adhesion process, in both 
normal and pathological settings, follows a multi-step cascade that involves intercellular 
adhesion molecule-1 (ICAM-1), vascular cell adhesion molecule-l (VCAM-1), and 
endothelial leukocyte adhesion molecule-1 (E-selectin) expression on endothelial cells 
5 (EC). The expression of these molecules and others on the vascular endothelium 
determines the efficiency with which leukocytes may adhere to the local vasculature and 
extravasate into the local tissue during the development of an inflammatory response. The 
local concentration of cytokines and growth factor participate in the modulation of the 
expression of these CAMs. 

10 Tumor necrosis factor alpha (TNF-a), a potent proinflammatory cytokine, is a 

stimulator of all three CAMs on endothelial cells and may be involved in a wide variety of 
inflammatory responses, often resulting in a pathological outcome. 

The potential of a polypeptide of the invention to mediate a suppression of TNF-a 
induced CAM expression can be examined. A modified ELISA assay which uses ECs as a 
5 solid phase absorbent is employed to measure the amount of CAM expression on TNF-a 
treated ECs when co-stimulated with a member of the FGF family of proteins. 

To perform the experiment, human umbilical vein endothelial cell (HUVEC) 
cultures are obtained from pooled cord harvests and maintained in growth medium (EGM- 
2; Clonetics, San Diego, CA) supplemented with 10% FCS and 1% penicillin/streptomycin 
in a 37 degree C humidified incubator containing 5% C0 2 . HUVECs are seeded in 96- 
well plates at concentrations of 1 x 104 cells/well in EGM medium at 37 degree C for 18- 
24 hrs or until confluent. The monolayers are subsequently washed 3 times with a serum- 
free solution of RPMI-1640 supplemented with 100 U/ml penicillin and 100 mg/ml 
streptomycin, and treated with a given cytokine and/or growth factor(s) for 24 h at 37 
degree C. Following incubation, the cells are then evaluated for CAM expression. 

Human Umbilical Vein Endothelial cells (HUVECs) are grown in a standard 96 
well plate to confluence. Growth medium is removed from the cells and replaced with 90 
ul of 199 Medium (.0% FBS). Samples for testing and positive or negative controls are 
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added to the plate in triplicate (in 10 ul volumes). Plates are incubated at 37 degree C for 
either 5 h (selectin and integrin expression) or 24 h (integrin expression only). Plates are 
aspirated to remove medium and 100 fi\ of 0.1% paraformaldehyde-PBS(with Ca++ and 
Mg++) is added to each well. Plates are held at 4°C for 30 min. 

5 Fixative is then removed from the wells and wells are washed IX with 

PBS(+Ca,Mg)+0.5% BSA and drained. Do not allow the wells to dry. Add 10 /xl of 
diluted primary antibody to the test and control wells. Anti-ICAM-l-Biotin, Anti-VCAM- 
1-Biotin and Anti-E-selectin-Biotin are used at a concentration of 10 fig/ml (1:10 dilution 
of 0.1 mg/ml stock antibody). Cells are incubated at 37°C for 30 min. in a humidified 

10 environment. Wells are washed X3 with PBS(+Ca,Mg)+0.5% BSA. 

Then add 20 fi\ of diluted ExtrAvidin-Alkaline Phosphotase (1:5,000 dilution) to 
each well and incubated at 37°C for 30 min. Wells are washed X3 with 
PBS(+Ca,Mg)+0.5% BSA. 1 tablet of p-Nitrophenol Phosphate pNPP is dissolved in 5 ml 
of glycine buffer (pH 10.4). 100 /il of pNPP substrate in glycine buffer is added to each 

15 test well. Standard wells in triplicate are prepared from the working dilution of the 
ExtrAvidin-Alkaline Phosphotase in glycine buffer: 1:5,000 (10°) > > 10 1 > 10 l 5 . 5 
/xl of each dilution is added to triplicate wells and the resulting AP content in each well is 
5.50 ng, 1.74 ng, 0.55 ng, 0.18 ng. 100 jil of pNNP reagent must then be added to each of 
the standard wells. The plate must be incubated at 37°C for 4h. A volume of 50 fi\ of 3M 

20 NaOH is added to all wells. The results are quantified on a plate reader at 405 nm. The 
background subtraction option is used on blank wells filled with glycine buffer only. The 
template is set up to indicate the concentration of AP-conjugate in each standard well [ 
5.50 ng; 1.74 ng; 0.55 ng; 0.18 ng]. Results are indicated as amount of bound AP- 
conjugate in each sample. 

25 The studies described in this example tested activity of a polypeptide of the 

invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 
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It will be clear that the invention may be practiced otherwise than as particularly 
described in the foregoing description and examples. Numerous modifications and 
variations of the present invention are possible in light of the above teachings and, 
therefore, are within the scope of the appended claims. 

The entire disclosure of each document cited (including patents, patent 
applications, journal articles, abstracts, laboratory manuals, books, or other disclosures) in 
the Background of the Invention, Detailed Description, and Examples is hereby 
incorporated herein by reference. Further, the hard copy of the sequence listing submitted 
herewith and the corresponding computer readable form are both incorporated herein by 
reference in their entireties. 

Exampk 53: Cloninp, sequence analysis and rhr 0 mosftnia | l»p g |; Tation nf , hp nftw , 
human inteprin alpha 1 1 suhiinj f 

Abstract 

The integrins are a large family of cell adhesion molecules consisting of noncovalently 
associated 00 heterodimers. We have cloned and sequenced a novel human integrin a - 
subunit cDNA, designated all. The all cDNA encodes a protein with a 22 amino acid 
signal peptide, a large 1120 residue extracellular domain that contains an I-domain of 207 
residues and is linked by a transmembrane domain to a short cytoplasmic domain of 24 
amino acids. The deduced all protein shows the typical structural features of integrin a- 
subunits and is similar to a distinct group of a-subunits from collagen-binding integrins. 
However, it differs from most integrin a-chains by an incompletetely preserved 
cytoplasmic GFFKR motif. The human ITGAU gene was located to bands q22.3-23 on 
chromosome 15, and its transcripts were found predominantly in bone, cartilage as well as 
in cardiac and skeletal muscle. Expression of the 5.5 kilobase all mRNA was also 
detectable in ovary and small intestine. 
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Introduction 

All vertebrate cells express members of the integrin family of cell adhesion molecules, 
which mediate cellular adhesion to other cells and extracellular subtratum, cell migration 
and participate in important physiologic processes from signal transduction to cell 
5 proliferation and differentiation {Hynes, 92; Springer, 92}. Integrins are structurally 
homologous heterodimeric type-I membrane glycoproteins formed by the noncovalent 
association of one of eight p -subunits with one of the 17 different cc-subunits described to 
date, resulting in at least 22 different ap 4 complexes. Their binding specificities for cellular 
and extracellular ligands are determined by both subunits and are dynamically regulated in 

10 a cell-type-specific mode by the cellular environment as well as by the developmental and 
activation state of the cell {Diamond and Springer, 94}. In integrin a -subunits, the 
aminoterminal region of the large extracellular domain consists of a seven-fold repeated 
structure which is predicted to fold into a P -propeller domain {Corbi et ai, 1987; 
Springer, 1997}. The three or four C-terminal repeats contain putative divalent cation 

15 binding motifs that are thought to be important for ligand binding and subunit association 
{Diamond and Springer, 94}. The a 1 , a 2 , a '°, a D , a E , a L , a M and cc x -subunits contain an 
approximately 200 amino acid I-domain inserted between the second and third repeat that 
is not present in other a-chains {Larson et ai, 1989}. Several isolated I-domains have been 
shown to independently bind the ligands of the parent integrin heterodimer {Kamata and 

20 Takada, 1994; Randi and Hogg, 1994}. The <r\ a 58 , a"" and a v -subunits are 
proteolytically processed at a conserved site into disulphide-linked heavy and light chains, 
while the a 4 -subunit is cleaved at a more aminoterminal site into two fragments that 
remain noncovalently associated {Hemler et ai, 90}. Additional cc-subunit variants are 
generated by alternative splicing of primary transcripts {Ziober et ai, 93; Delwel et ai, 95; 

25 Leung et ai, 98}. The extracellular domains of cc-integrin subunits are connected by a 
single spanning transmembrane domain to short, diverse cytoplasmic domains whose only 
conserved feature is a membrane-proximal KXGFF(K/R)R motif {Sastry and Horwitz, 
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1993}. The cytoplasmic domains have been implicated in the cell-type-specific modulation 
of integrin affinity states {Williams et al y 1994}. 

Here we report the cDNA cloning, sequence analysis, expression and chromosomal 
5 localization of the human cc-integrin subunit. 

Materials and Methods 

Library screening and DNA sequencing. 

A human fetal heart cDNA library in XgtlO (Clontech Laboratories, Inc., Palo Alto, CA, 
10 USA) was screened with ?2 P-labelled (retf/prime, Amersham New Zealand Ltd., Auckland, 
New Zealand) probes corresponding to the regions 473 to 749 and 2394 to 3189 of the al 1 
cDNA using standard procedures. Inserts were subcloned from XgtlO into pUC21 and 
sequenced on both strands according to a successive specific primer strategy on an 
automated sequencer (Applied Biosystems 373A, The Centre for Gene Technology, School 
15 of Biological Sciences, The University of Auckland). 

Northern Blot Analysis and Tissue distribution. 

A 134Ibp PCR fragment corresponding to the region 351-1692 of the a cDNA was 32 P- 
labelled (raflprime) and hybridized with human multiple tissue Northern blots (MTN I and 

20 MTN II, Clontech) for 16h at 60<>°C in ExpressHyb solution (Clontech). Filters were 
washed twice with O.IxSSC/i% SDS at 50°oC for 30min, and autoradiographed. Human 
DNA from 63 tissue-specific cDNA libraries (Express-Check™, American Type Culture 
Collection, Manassas, VA, USA) was amplified using primers KL120 (5*- 
GCAGGGATGCCACCTGCC) and KL119 (S'-GATGAAGACTGTGGTGTCGAAGG) 

25 according to the manufacturers instructions. PCR-products were resolved by agarose gel 
electrophoresis and transferred to Hybond C+ (Amersham). Filters were hybridized by 
standard procedures {Ausubel et al, 98} with a 502bp 32 P-Iabelled (rediprime) probe 
fragment obtained from the cloned " M cDNA with the same oligonucleotides. 
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Chromosomal assignment. 

500ng genomic DNA prepared from a panel of 21 human-rodent somatic cell hybrids or 
from human, mouse and hamster cells {Kelsell et ai, 95} was amplified with 
oligonuleotides KL175 (S'-GGTGCCAGACCTACATGGAC) and KL189 (5'- 
CGTGCAAATTCAATGCCAAATGCC) in a standard PCR reaction of 30 cycles (94~C 
for lmin, 55~C for lmin, 72~C for 2min). All PCR reactions were resolved in a 2% 
agarose gel. Southern hybridization was performed as detailed above, except that the probe 
fragment was obtained from clone HOHBY69 with oligonucleotides KL175 and KL189. 
For fluorescent in situ hybridization, metaphase spreads were prepared from 
phytohemagglutinin-stimulated peripheral blood lymphocytes of a 46,XY male donor 
using standard cytogenetic procedures. A purified 3.7kB fragment representing the entire 
coding region of clone HOHBY69 was labelled with biotin-16-dUTP using the High Prime 
labelling kit (Roche Molecular Biochemicals, Auckland, NZ). Conditions for hybridization 
and immunofluorescent detection were essentially as described {Morris et ai, 93), except 
that C 0 t-1 suppression was not required, slides were washed to a stringency of 
0,lxSSC/60ooC after hybridization, and an additional amplification step was needed 
because of the small size of the probe. For precise chromosome band localization, DAPI 
and FITC images were captured using a Photometries KAF1400 CCD camera and QUIPS 
Smartcapture FISH software version 1.3 (Vysis Inc., Downers Grove, IL, USA). QUIPS 
CGH/Karyotyping software (version 3.0.2) assisted karyotype analysis. 

Results 

Cloning of a novel human a-integrin subunit cDNA: 

A protein homology search {Altschul et ai, 90} of the human expressed sequence tag 
(EST) databases of Human Genome Sciences, Inc. {Ni etai, 97} and The Institute for 
Genomic Research {Kirkness and Keriavage, 97} identified the clones HRDAF83 and 
HOEAM34 as candidate novel integrin a -subunit cDNAs. Clone HRDAF83 was isolated 
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from a human rhabdomyosarcoma cDNA library and sequenced on both strands. The 
1223bp insert contains largely incompletely processed hnRNA and a 277bp region that 
showed homology to the aminoterminal half of the al-integrin I-domain . The 2517bp 
insert of clone HOEAM34 was derived from a human osteoblast cDNA library. It is 
homologous to the C-terminal part of the human ccl-subunit and contains 1324 nucleotides 
of 3'-untranslated region . In order to isolate the full-length cDNAs for these integrin a- 
subunits, a cDNA library prepared from human fetal heart in A.gtlO was screened with the 
277bp fragment from clone HRDAF83 homologous to the ccl-I-domain. Two clones, X831 
and A.832, were isolated and both strands of their inserts sequenced. Clone A.832 contains 
the entire 5' half of a novel a-subunit cDNA, while clone X831 covers the same region, but 
is 358bp and 173bp shorter than X831 at its 5'- and 3'-ends, respectively. A screening of the 
same library with a 795bp fragment from the extreme 5'-terminus of clone HOEAM34 
identified clone X342, which contained essentially the same region as clone HOEAM34 
but has a 317bp shorter S'-untranslated region . Rescreening the EST databases with the 
sequences derived from the human fetal heart library led to the identification of clone 
HOHBY69, which was isolated from a osteoblast cDNA libray. Both strands of the 4681bp 
insert of clone HOHBY69 were sequenced. The S'-region of HOHBY69 was identical to 
the HRDAF83/X832/X831 -group, while the 3'-region of HOHBY69 was largely identical 
to HOEAM34 and A.342, thereby demonstrating that the two groups of partial cDNAs 
represent the 5'- and S'-portions of the same cDNA . One major difference between the 
HOHBY69 and HOEAM34/X342 is the presence of an additional GTA-triplet at position 
3088 in HOHBY69. From the overlapping clones, a total of 4986bp of cDNA was 
assembled to the composite sequence shown in Figure 2 and has been submitted to 
GenBank™ with accession number AF109681. This cDNA encodes a previously 
unidentifed human integrin a-subunit that was designated all. 



SUBSTITUTE SHEET (RULE 26) 



WO 00/29435 PCT/US99/25031 

565 

Structure of the human all-subunit. 

The all cDNA contains a 5-untranslated region of 72 nucleotides and a single open 
reading frame extending from a predicted translation initiation codon at position +1 to a 
TGA termination codon at position 3570 . This is followed by 1324 nucleotides of 3" 
untranslated region which contains an AATTAAA polyadenylation signal {Wahle and 
Keller, 1996} 12 nucleotides upstream of a poly(A) stretch. The deduced amino acid 
sequence contains a 22 residue N-terminal region with the characteristics of a cleaved 
signal peptide {von Heijne, 83; Nielsen et al, 97}, a large extracellular domain of 1120 
amino acids followed by a 23 amino acid hydrophobic stretch that resembles a 
transmembrane domain, and a short 24 residue cytoplasmic domain. The molecular weight 
of the mature 1 167 amino acid al 1-subunit is predicted to be 13 1 kDa, but the addition of 
carbohydrate side chains to any of the 15 potential N-glycosylation sequons [NX(S/T)] 
within the extracellular domain is likely to increase the molecular weight of the native 
protein. An I-domain of 207 amino acids is inserted between the second and third repeat . 
Consistent with the structure of an typical I-domain-containing integrin a-subunit, it lacks 
a potential dibasic protease cleavage site in the C-terminal region of the extracellular 
domain. 

The al l-subunit is most closely related to the recently discovered alO-subunit (Camper et 
al, 98, Lehnert et al, in preparation} and the al- and a2-subunits. Overall, the mature 
all-protein is 45% identical to the alO chain, while the homologies to the al- and a2- 
subunits are 41% and 39%, respectively. Even greater homology exists between the I- 
domains of the alO- and all-subunits which are 60% identical to each other. The high 
degree of homology seen in the extracellular domains of the subunits is in contrast to the 
low similarity of their cytoplasmic domains. Interestingly, the KXGFF(K/R)R motif that is 
absolutely conserved in all other a-subunit cytoplasmic domains is only partially preserved 
in both subunits. The sequence in all is KLGFFRS, while the alO-subunit contains a 
KLGFFAH motif. A graphical comparison of the similarity between all integrin a-subunits 
is shown in Fig. 3. Together with the a-subunits from the collagen-binding integrins alpl, 
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a2pland alOpl, the all-subunit forms a group distinct from the other I-domain- 
containing integrin subunits. 



Tissue distribution and expression of the integrin al l-subunit. 

The tissue distribution of the all mRNA was assessed by screening multiple human tissue 
Northern blots with a probe corresponding to the region 351-1692 of the al 1 cDNA. A 
single transcript of approximately 5.5kb was found weakly expressed only in ovary and 
small intestine . Integrin all-subunit expression was further analyzed by amplification 
and Southern hybridization of a 502bp fragment corresponding to the region 1988-2490 
the all cDNA from tissue-specific human cDNA libraries, all cDNA was detected i 
five different cDNA libraries prepared from fetal heart (day 57-75), in two fetal brain 
libraries, and in a cDNA library from large intestine (not shown). An analysis of the 
Human Genome Sciences Database revealed eight different all -related ESTs in human 
osteoblast libraries, three EST in a human chondrosarcoma cDNA library and two EST in a 
human stromal osteoclastoma library. 



m 



in 



Chromosomal localization of the integrin all-subunit. 

Genomic DNA from a collection af 21 human-rodent somatic cell hybrids {Kelsell et al., 
95} was amplified by PCR using oligonucleotide primers directed the the region 473 to 
749 of the human all cDNA. In Southern hybridization, a signal corresponding to a l,4kb 
fragment was detectable only with DNA from a hybrid cell line that contains human 
chromosome 15. A fragment of the same size was also amplified from human genomic 
DNA, but not from mouse or hamster DNA (Fig. 5C). Cloning and sequence of the PCR 
product from chromosome 15 revealed the presence of a 1154bp intron inserted after 
cDNA-position 600, thus resulting in a PCR-product of 1431 bp. The ITGA1 1 gene was 
also localized by fluorescent in situ hybridization of metaphase chromosomes with the 
entire coding region from clone HOHBY69. All of 20 metaphase cells analyzed showed 
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fluorescent signal on both chromosomes 15, specifically across bands q22.3-q23. No 
additional signals were detected on any other chromosome (Fig. 5 A). 

Discussion: 

5 We have cloned and sequenced a novel cDNA encoding a protein that shares extensive 
structural homology with integrin a-chains. The aminoterminal 22 amino acids of the 
deduced protein sequence show the characteristic features of a hydrophobic leader peptide, 
including a signal peptidase recognition motif at positions -3 and -1 {von Heijne 83). 
Proteolytic cleavage of the precursor protein at this position would result in an 

10 aminoterminal sequence for the mature all-chain of FNMD, which is similar to the 
consensus sequence[(F/Y)M(L/V)D] of all other integrin ccl 1-subunits {Tuckwell et ai, 
94}. The N-terminal half of the large extracellular region of all is composed of seven 
repeats that each contain FG--GAP~GxxY consensus motifs (FG-GAP repeats). These 
repeats can be found in all integrin al-subunits and are predicted to fold into a p-propeller 

15 domain (Springer, 97}. Inserted between the second and third FG-GAP repeats is a 207 
amino acid I-domain spanning from glutamine" 8 to methionine" 14 . It contains a divalent 
cation coordination motif that has been shown to directly bind Mg 2+ ions in the a M subunit 
{Michishita et ai, 96}. The noncontiguous amino acid side chains involved in the 
coordination of magnesium or manganese ions have been identified by mutagenesis 

20 analysis and from crystal structures of the isolated a 2 , a L and a M -subunit I-domains 
{Emsley et ai, 97; Qu and Leahy, 95; Lee et ai, 95; }. All residues required for the 
coordination of the divalent cations in these subunits are preserved in the all-I-domain. 
These are the asparagines at positions 148 and 249, the serine residues at position 150 and 
152, and the threonine at position 218. 

25 The crystal sctructure of the a2-subunit has revealed a small "-helix that is not present in 
the I-domains of the 32-associated a-subunits. Together with the MIDAS sphere, amino 
acid residues from this C-helix and the adjacent turn region have been proposed to make 
physical contacts to a collagen triple helix {Emsley et ai, 97}. Interestingly, the small C- 
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helix is structurally conserved in the a-subunits of the collagen-binding integrins ocipi, 
Copland aiopi, and is also present in the cclll-domain (G^YYNR 283 ). In addition, 
asparagine' 54 and histidine 238 of the <x2-I-domain were predicted to contact the collagen 
triple helix, and both are preserved in the al, alO and all-I-domains, but not in other 
integrin al 1-subunits. The conservation of structural motifs required for collagen binding 
suggests that collagen may be a ligand for the al 1 integrin. Each of the repeats 5-7 of the 
all-subunit accomodates the sequence Dx(D/N)xDxxxD. Three or four copies of these 
putative divalent cation binding sites are conserved in all integrin a-subunits and their 
presence is consistent with the divalent cation requirement for the adhesive function of 
integrins {Larson et al, 89; Fujimura and Phillips, 83; Hynes, 92}. The extracellular 
domain of the integrin al 1-subunit contains 20 cysteine residues. Only the intramolecular 
disulfide bonds in the " llb subunit have been biochemically characterized {Calvete et al, 
89}, but the location of many cysteines is conserved in integrin a-subunits. In the all- 
subunit, the cysteine residues 637 and 646, 652 and 707, 759 and 765, and 859 and 871 are 
homologous to the residues that form the four carboxyterminal disulfide bonds in the 
heavy chain of "» {Calvete et al, 89}. Based on the proposed structure of the integrin a- 
subunit propeller domain {Springer et al., 97}, additional disulfide bonds within the all 
subunit can be predicted between cysteine residues 54 and 61, 99 and 117, and between 
107 and 137. Two additional cysteine residues are found within a short segment (residues 
783 to 798) that is unique to the al 1-subunit. 

The integrin cytoplasmic domains play central roles in integrin affinity modulation and in 
cellular signal transmission. The membrane-proximal sequence KxGFF(K/R)R is strictly 
conserved among integrin a-subunit cytoplasmic domains {Williams et al, 94}. Within 
this motif, both phenylalanine residues and the last arginine have been implicated in 
maintaining the default low affinity state of integrins a% and a"%, as their substitution 
or deletion resulted in constitutively activated ligand binding {O'Toole et al, 94, Lu and 
Spriner 97}. Interestingly, the last arginine residue is replaced by a serine in the al 1 
cytoplasmic domain and with a histidine in the alO subunit, suggesting that both integrins 
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might be in a default "high" affinity state. It will be interesting to analyze whether 
substitution of these residues with a conserved arginine will affect their affinity status. 
We have isolated all cDNAs from osteoclast, osteoblast, myosarcoma and fetal heart 
libraries. Amongst the HGS EST databases, integrin all transcripts were predominantly 
found in libraries prepared from osteoblast, osteoclast and chondrosarcoma cells. A search 
for further ""-related sequences in the EST division of the GenBank database revealed two 
clones (accession numbers Z50157 and Z50167) from primary human myoblasts {Genini 
et a/., 96}, two clones from human trabecular bone cells (AA852614 and AA852615), as 
well as clones from fibroblast cells (W45078), pancreatic tumor (U53091) and breast tissue 
(H16112). In contrast, Northern blot analysis detected all -expression only in ovary and 
small intestine, only fetal heart, fetal brain and large intestine. Of the tissues represented in 
the tissue-specific cDNA-library panel, only fetal heart, fetal brain and large intestine 
showed detectable all-expression. However, bone- and muscle-derived tissues were not 
included in the Northern blot, and cDNA libraries prepared from these tissues were also 
not represented in the tissue-specific cDNA panel. 

The /TGAiigene was localized to chromosome 15, bands q22.3-23, by FISH and 
PCR analysis of human-rodent somatic cell hybrids. This segment is overrepresented in 
squamous cell carcinomas {Wolff et aL, 1998}, but appears to only infrequently affected in 
other cancers. Genes at this region encode neogenin, a protein expressed ubiquitously 
expressed in human tissues {Meyerhardt et aL, 97}; tropomyosin 1, expressed in cardiac 
and skeletal muscle tissues {Tiso et al, 97}; and the human homologue of the 
metalloprotease-disintegrin kuzbanian, which is overexpressed in tumors of 
sympathoadrenal origin { Yavari et a/., 98}. In addition, the region 15q22.3-q23 is linked to 
Bardet-Biedl syndrome 4, a heterogeneous autosomal disorder characterized by obesity 
and associated with cardiovascular anomalities {Carmi et ai, 95 } . 

In conclusion, we have cloned and sequenced the cDNA for the novel integrin al 1-subunit 
which is closely related to the "-subunits of the collagen-binding integrins alpl, a2(Jl and 
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cclOpl. The high degree of homology of al 1 to these subunits suggests that it associates 
with the integrin 01 -subunit, and may function as an additional collagen receptor. 
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The applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform 
the International Bureau accordingly before completion of technical preparations for 
publication of the international application. 

NORWAY 

The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert. The request to this effect must be filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a 
request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up by the Danish Patent Office or any person by the applicant in 
the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 16 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert 
may be any person entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 31 F(l) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 
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What Is Claimed Is: 

1 . An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected from the group consisting 
of: 

(a) a polynucleotide fragment of SEQ ID NO:X or a polynucleotide fragment of 
the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID 
NO:X; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NO:Y or a 
polypeptide fragment encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(c) a polynucleotide encoding a polypeptide domain of SEQ ID NO:Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(d) a polynucleotide encoding a polypeptide epitope of SEQ ID NO:Y or a 
polypeptide epitope encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO:Y or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X, having 
biological activity; 

(f) a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species homologue of the SEQ ID NO:Y; 

(i) a polynucleotide capable of hybridizing under stringent conditions to any one of 
the polynucleotides specified in (a)-(h), wherein said polynucleotide does not hybridize 
under stringent conditions to a nucleic acid molecule having a nucleotide sequence of only 
A residues or of only T residues. 



2. The isolated nucleic acid molecule of claim 1 , wherein the polynucleotide 
fragment comprises a nucleotide sequence encoding a secreted protein. 

3. The isolated nucleic acid molecule of claim 1 , wherein the polynucleotide 
fragment comprises a nucleotide sequence encoding the sequence identified as SEQ ID 



SUBSTITUTE SHEET (RULE 26) 



W0 00/29435 PCT/US99/25031 

579 

NO:Y or the polypeptide encoded by the cDNA sequence included in ATCC Deposit 
No:Z, which is hybridizable to SEQ ID NO:X. 

4. The isolated nucleic acid molecule of claim 1 , wherein the polynucleotide 
fragment comprises the entire nucleotide sequence of SEQ ID NO:X or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X. 

5. The isolated nucleic acid molecule of claim 2, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

6. The isolated nucleic acid molecule of claim 3, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

7. A recombinant vector comprising the isolated nucleic acid molecule of 
claim 1. 

8. A method of making a recombinant host cell comprising the isolated nucleic 
acid molecule of claim 1. 

9. A recombinant host cell produced by the method of claim 8. 

10. The recombinant host cell of claim 9 comprising vector sequences. 

11. An isolated polypeptide comprising an amino acid sequence at least 95% 
identical to a sequence selected from the group consisting of: 

(a) a polypeptide fragment of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(b) a polypeptide fragment of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z, having biological activity; 

(c) a polypeptide domain of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 
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(d) a polypeptide epitope of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(e) a secreted form of SEQ ID NO:Y or the encoded sequence included in ATCC 
Deposit No:Z; 

(0 a full length protein of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(g) a variant of SEQ ID NO:Y; 

(h) an allelic variant of SEQ ID NO:Y; or 

(i) a species homologue of the SEQ ID NO:Y. 

12. The isolated polypeptide of claim 1 1 , wherein the secreted form or the full 
length protein comprises sequential amino acid deletions from either the C-terminus or the 
N-terminus. 



13. An isolated antibody that binds specifically to the isolated polypeptide of 
claim 11. 



14. A recombinant host cell that expresses the isolated polypeptide of claim 1 1 . 

15. A method of making an isolated polypeptide comprising: 

(a) culturing the recombinant host cell of claim 14 under conditions such that said 
polypeptide is expressed; and 

(b) recovering said polypeptide. 

16. The polypeptide produced by claim 15. 

17. A method for preventing, treating, or ameliorating a medical condition, 
comprising administering to a mammalian subject a therapeutically effective amount of the 
polypeptide of claim 1 1 or the polynucleotide of claim 1. 

1 8. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 
claim 1 ; and 
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(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or absence of said mutation. 

19. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or amount of expression of the polypeptide of claim 
1 1 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 

20. A method for identifying a binding partner to the polypeptide of claim 1 1 
comprising: 

(a) contacting the polypeptide of claim 1 1 with a binding partner; and 

(b) determining whether the binding partner effects an activity of the polypeptide. 

21. The gene corresponding to the cDNA sequence of SEQ ID NO:Y. 

22. A method of identifying an activity in a biological assay, wherein the 
method comprises: 

(a) expressing SEQ ID NO:X in a cell; 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and 

(d) identifying the protein in the supernatant having the activity. 

23. The product produced by the method of claim 20. 
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FIG. 1A 



1 gcgtccgggccaaaatgctgagaacgtccactcctaatctgtgtggtggtctgcattgcc 6 0 
6 1 c^gccccct<xk;tctcttctggcattctctgccot 120 

121 TGGGCTTGCTGC AGGGACACCCCCAGTGCCTGGATTACGGGCCCCCTTTCCAGCCCCCTC 180 
181 TGCACCTTGAGTTTTGCTCTGACTATGAGTCCTTCGGCTGCTGTGATCAGCACAAGGACC 240 
241 GCCGCATCGCTGCCCGGTACTGGGACATCATGGAATATTTTGATCTGAAGAGACATGAGC 300 
301 TGTGTGGAGATTACATTAAAGAC ATCCTTTGCCAGGAGTGCTCGCCCTACGC AGCCCACT 3 60 

361 CTACGACGCCGAAAACACCCAGACGCCTCTCCGGAATCTCCCGGGCCTCTGCTCTGATTA 420 

42 1 CTGCTCTGCCTTCCATTCTAACTGTCACTCAGCCATTTCCCTGCTGACCAATGACCGCGG 480 

481 CCTCCAGGAGTCTCATGGAAGGGACGGTACCCGCTTCTGCCACCTCCTGGACCTTCCTGA 540 

541 C AAGGACT AT TGC TTC C C T AATGTC C TG AGGAACG ACT AT CTC AAC C GCC AC C TGGG C AT 600 

1 M 1 

601 GGTGGCCCAAGATCCTCAGGGCTGCCTGCAGCTCTGCCTGAGCGAGGTGGCCAACGGGCT 660 

2 VAQDPQGCLQLCLSEVANGL 21 

661 GAGGAACCCCGTCTCCATGGTCC ATGCTGGGGACGGCACCCATCGCTTCTTTGTTGCCGA 720 

22 RNPVSMVHAGDGTHRFFVAE 41 

721 GCAGGTAGGAGTGGTG'PGGGTCTACCTCCCTGATGGGAGTCGCCTGGAGCAACCCTTCCT 780 

42 QVGVVWVYLPDGSRLEQPFL 61 

781 GGACCTCAAGAACATCGTGTTGACCACCCCATGGATCGGGGATGAGAGAGGCTTCTTGGG 840 

62 DLKNIVLTTPWIGDERGFLG 81 

841 GTTGGCTTTTCACCCCAAATTCCGCCACAATCGCAAGTTCTATATTTATTATTCGTGCCT 900 

82 LAFHPKFRHNRKFYIYYSCL 101 

901 GGACAAGAAGAAGGTAGAAAAGATCCGAATTAGTGAGATGAAGGTTTCTCGGGCTGATCC 960 

102 DKKKVEKIRISEMKVSRADP 121 

961 T AAC AAAGCTGAC CTG AAATC AG AG AGGGT CAT CTTGG AG ATTG AAG AACCAGC CTC AAA 1020 

122 NKADLKSERVILEIEEPASN 141 



1021 CCATAATGGCGGACAACTTCTTTTTGGCCTGGATGGCTATATGTACATATTCACTGGGGA 1080 

142 HNGGQLLFGLDGYMYIFTGD 161 

1081 CGGGGGACAGGCTGGAGATCCCTTTGGCCTGTTTGGAAATGCTCAGAACAAAAGTTCCCT 1140 

162 GGQAGDPFGLFGNAQNKSSL 181 
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1141 GCTGGGAAAAGTTTTAAGGATCGATGTGAACAGGGCAGGCTCACATGGCAAGCGGTACCG 1200 
182 LGKVLRIDVNRAGSHGKRYR 201 



1201 AGTCCCCTCGGACAATCC ATTTGTTTCTGAGCC AGGGGCC CACCCC GC C ATCTATGC CTA 1260 
202 VPSDNPFVSEPGAHPAIYAY 221 



1261 TGGGATCAGGAACATGTGGCGTTGTGCTGTGGACCGAGGGGACCCCATCACGCGCCAGGG 1320 

222 GIRNMWRCAVDRGDPITRQG 241 

1321 C CGAGGCCGGATATTC TGTGGGGACGTGGGC C AGAAC AGGTTTG AAGAGGTTGAC CTC AT 13 B 0 

242 RGRIFCGDVGQNRFEEVDLI 261 

1381 TTTGAAAGGTGGAAACTATGGCTGGAGAGCAAAGGAAGGGTTTGCATGTTATGACAAAAA 1440 

262 LKGGNYGWRAKEGFACYDKK 281 

1441 ACTTTGTCACAATGCCTCTTTGGATGATGTTCTGCCAATCTATGCTTATGGCCATGCAGT 1500 

282 LCHNAS LDD VL P IYAYGHAV 301 



1501 GGGGAAGTC AGTCACTGGAGGTTATGTCTATCGTGGTTGTGAATCCCC AAATCTCAATGG 1560 
302 GKSVTGGYVYRGCESPNLNG 321 



1561 CCTGTATATCTTTGGAGACTTCATGAGTGGTCGACn^ATGGCTTTGCAGGAAGATAGAAA 1620 
322 LYIFGDFMSGRLMALQEDRK 341 



1621 AAACAAGAAATGGAAGAAGCAGGATCTTTGCCTGGGCAGCACCACGTCCTGTGCCTTCCC 1680 
342 NKKWKKQDLC LGSTTSCAF P 361 



1681 AGGGCTGATCAGCACCCATAGCAAGTTCATCATCTCCTTTGCTGAAGATGAAGCAGGGGA 1740 
362 GLISTHSKFI ISFAEDEAGE 381 



1741 GCTGTATTTCCTGGCGACCTCTTAC C C AAGTGCCTATGC ACCACGTGG ATCTATTTAC AA 1800 
382 LYFLATSYPSAYAPRGSIYK 401 



1801 GTTTGTTGACCCCTCAAGGCGAGCACCCCCAGGCAAGTGCAAATACAAGCCAGTGCCCGT 1860 
402 FVDPSRRAPPGKCKYKPVPV 421 



1861 GAGAACCAAGAGTAAGCGGATCCCGTTCAGACCACTCGCCAAGACAGTCTTGGACTTGCT 1920 
422 RTKSKRIPFRPLAKTVLDLL 441 



1921 AAAGG AAC AATC AGAGAAAGCTGC T AG AAAATCTTC CAGTGC AAC C TT AGC TTC TGGC CC 1980 
442 KEQSEKAARKSSSATLASGP 461 



1981 AGCCC AGGGTTTGTCTGAGAAAGGCTCCTCCAAGAAGCTGGCTTCTCCTACAAGCAGCAA 2040 
462 AQGLSEKGSSKKLASPTSSK 481 



2041 GAATACATTGCGAGGGCCTGGTACAAAGAAGAAAGCCAGAGTGGGGCCCCACGTCCGCCA 2100 
482 NTLRGPGTKKKARVGPHVRQ 501 
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2101 GGGC AAG AGG AGG AAG AGC C TG AAAAGC C AC AGTGGCAGGATGAGGCCATCAGC AGAGCA 2160 

502 GKRRK. SLKSHSGRMRPSAEQ 521 

2161 GAAGCG AGC TGGC AGAAGTCTCC CTTGACCTATTGGTC AAGGTGGCCGAC AGGGTGACGT 2220 

522 KRAGRSLP* 530 

2221 GAGAGAGGAGAGCCACCTCATCAAATGAAAGTC ACTGCTGAATAAAGACCTTAGAAGTCT 2280 

2281 GGGAAGCCAGGGTAGAGGTGGGGCAGGGCGGTTTTCCTCTCCCTGGGAAATCTTGCTGTC 2340 

2341 TACTGAATAAATAAATGCACCTTCTCTGTATGCAGTGCTTCTGTGGGAGACCATATCCCA 2400 

2401 GATTGCTGGTGCACCTGGGTTATGGTAAGCACTATCCATGAGC CTGCTTGGAATCACACT 2460 

2461 GGATGTCTCCGTTTTGTCTTGTAAATGCCTACAACCTGAGGTAATAAATCAACATTTGCT 2520 

2521 CAAACTGGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 2580 

2581 AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 2609 
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30 



40 
— J- 



HACRB64 aa 

jPKPTPNSERVSVRFPGCRTGMHMISVSLRLVFCSFIFKA gi|l234787 
]LKM LSF KLLLLAVAL6F FEG gb|AAD3 1172.1 



— r- 

50 



— I— 

60 



— i— 

70 
J- 



80 



HfiGRB64.aa 

GVLLG HPQCLDYGPP FKPLVHLEF gi|l234787 

DAKFGERSEGSGARRRRCLNGNPPKRLKRRDRRVMSQLEL AAD31172.1 



90 



— 

100 
I 



110 



120 

-a. 



HACfiB64.aa 

CSE-YETFGCCDQDRDNVIAEKYWSIMDYFDLN gi|l234787 

LSGGEILCGGFYPRVSCCLQSDSPGLG RLENKIFSAT gb|AAD81172.1 



130 
U- 



140 



— ,— 

150 
l_ 



T 

160 

L. 



_ — _ _ — — HAQAB64 aa 

NYHICGGYIKDILCQECSPYAAHLYDAEDPHTPLR VI gi|l234787 

NNSECSRLLEEIQCAPCSPHSQSLFYTPE-RDVLDGDLAL #|AHJ31172.1 



170 



— I — 

180 



190 



— r 
200 
I. 



HftCRB64.aa 



PGLCFNYCSEFHLKCQNSITLLTEDKQIRESCDKGRDLFC gi|l234787 
PLLCKDYCKEFFYTCRGHI PGLLQTTA — -DEFC tf>|AAD31172.1 



2 
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169 



210 
t_ 



220 
— I— 



230 
l_ 



— r 

240 



HAQAB64.aa 

LHNTELNNNLGSB gi|l234787 
FYYARKDAGLCFPDFPRKQVRGPASNYLGQMEDYEKVGGI gb|AAD31172.1 



SLLNLPDEDYCFPNV- 



HftORB64.aa 

"BBS gi (1234787 
SEQlBilQ gb|AAD31172.1 



HACftB64.aa 
LJB gi 1 1234787 
Lgs gb|AAD31172.1 




_ I HKftB64.aa 

1 Y Q N HNHI V 4P I M0 E Y R "Ffffljin^nfl gi|l234787 

]N Y K K|J|G|3lQvsHT TNQERWAIGPH DH I LBvvgY t| gb|AH>31172.1 



HX£B64.aa 
LR gi 1 1234787 
FL0 gb|AAD31172.1 



jjHACftB64.aa 

DGRRAN gi|l234787 
LEDdTDMCN #>|AAIB1172.1 
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HACAB64.aa 
rBOlvD3i|3ke gi| 1234787 

gb|AAD3U72.1 



HftCftB64.aa 



634 ISBB E 1 V NQS V K PT|JHPHS KTp TKF PTKV PPTPTKFPTKV gi|1234787 
597 TVBPAQP gb|AflE01172.1 



— r~ 

730 
I 



I 

740 
— L_ 



I 

750 

L_ 



T 

760 



460 - 



«r wm _ ! HfiCftB64.aa 
674 PPTHtHFPTKVPPIPTKVPSKVPPTPTQFPTKV PPTPtDv gi|l234787 
604 gfe|AHD31172.1 



I HMfiB64.aa 



714 QTQVQS TPTIAHTK vWlnHrH LPSKABsHQTMVPTK vQp T gi|1234787 
604 ~ - LTSDCgR L C R NgY Y Q P T G K C C C S P G W gb|AAD31172.1 



810 



-r 



830 



501 BE H" ~ " " HftCRB64.aa 

754 PTBL PTKVPPITTKVSNKVLLTSPELPTKVPPTPTKLPTN gi|1234787 
630 Eg D F - - _ _ _ _ gb|AAD3H72.1 



850 



860 



870 



504 



HftQfiB64.aa 



794 APPTSILLSPTPIKLPTKISLTLTSVPIKNQLTSAKLLTT gi|l234787 



634 



gb|AAEB1172.1 
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890 



504 



900 
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910 
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— r 

920 



~ HRCRB64.aa 

834 TLPISTKRATKLPSTSTSVPSNTSCILTHVQPKMLPTETR gi|l234787 
634 - - CRIAKCEPACRHGGVCV 



— I — 

930 



940 



950 



504 

874 VPNKMPPKPTR I P T M S M Y I T K K P PfiB K NgA K K V T D K 

651 - -RpnkclckkgylgpqceqvdrnvGrvt 



gfe|AAD31172.1 



960 



HAQAB64.aa 
TK gi |1234787 
- - gb|AAC31172.1 



970 
l— 



980 
L- 



990 



520 

914 spDttkppkppksktsvvnqpkkketktgvnn k tTIt 
678 ragildq iidmtsy 




1010 



530 
954 
693 



I 

1020 
J 



— I — 

1030 



1000 

HftCAB64.aa 
PK gi |1234787 
- - gb|AAD31172.1 



1040 

L. 



HAfflB64.aa 

954 AKEPKKEKKTIKVKQPVSHYFPPQKPKKQKIKKMQKEGNE gi|1234787 
«» LDLTSYIV gb|AAD3H72.1 



994 KS HACfiB64.aa 
KS gi|1234787 
700 gb|AAD31172.1 

Deooratian 'Decoration #1': Shade (with solid hlack) residues that natch HfiCftB64.aa exactly. 
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1 CACCAGCACCCCGCCCAGAGCAGTGCCGCTGCCCAAATCCTCGCAGGCAGCTCATCAACG 6 0 

61 CAATTGCAACTCCGGCTGGAGCCCCGGACCTGCAAGCCTGGGTGTCCGTGGGTCCGTCTG 120 

121 CCCAGCCATCTGCTGGTGGCACCTCTCCCTCCTGCCGCCTCCCTCGGTGAACCCCACCTT 180 

181 G C AGAAGTGC AGC TCGCCCGGAGCAGC CC AGGAGCTC AGC ATGCGTCCC CCAGGCTTC AG 240 

1 M R P P G F R 7 

241 GAACTTCTTGCTGCTGGCGTCCTCCCTTCTCTTTGCTGGGTTGTCAGCTGTTCCTCAAAG 300 

8 NFL LLASSLLFAGLSA V P Q S 27 

301 CTTCTCGCCATCTCTGAGGAGCTGGCCGGGCGCCGCCTGCAGGCTGTCCCGGGCCGAGTC 360 

28 FSPSLRSWPGAACRLSRAES 47 

3 4 8 ^^R^R^^R^ 0 ^^ 420 



421 c ^CTCCGACTGCGGCGTCTGCATCTGCCACGTGACTC^GCCOGGCATGTTCTTCGGGCC 480 



68 



87 



481 CCO^TGAGTGCCATGAGTGGGTGTGCG 540 
88 WM E C H E W V C 



107 



541 rcgT^GTGTGA^ 600 



108 



127 



601 Q ™££ Q ^ CT ^^ 660 
128 ^ m T N C n T. T K K K S N Q jMMM 147 



661 A ^£ftS?P^ TC ^ 720 



148 



167 



168 A «M« 780 



781 Sfiflfffi^ 840 



188 



207 



208 900 
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FIG. 4B 



228 i«li«^ IiB^^ 247 



961 ^m^^ 1020 



248 



G D I H G D 267 



1021 CACCTGTGAATGTGATGAGAGGGACTGTAGAGCTGTCTATGACCGATATTCTGATGACTT 1080 
268 TCECD ERDCRAVY l ^^^^^g^^^g 287 



1081 C^TTCAGGTCATGGACAGTGTAATTGCGGAAGATGTGACTGCAAAGCAGGCTGGTATGG 
288 



1140 
307 



1 308 G |im^^ 1200 



1201 Cfiffi^^ 12 60 

mmmmmmmmmmmmmmmmmx^mmmmm ^ 347 



1261 CTGCTATCCT^ 



348 



TCECDDRR 



1320 
367 



1321 CTGTGAAGACCTCGATGGTGTG OTCTGTGGAG^ 
368 



1380 
387 



1381 TCTI^ 



1440 
407 



1441 GGAAGAACAAAGCAAGAATCTGTGTGAATCAGCA^TGGCATATTC 
408 EEQSKNLCE 



1500 
427 



1501 TTCTTGTCATTGTGGGAAGTGCA TTTGTTCTGCTGAAGAGTGGTAT^ 
428 " 



1560 
447 



1561 CTGTGACTGTGATGACAGAGACTGCGACAA^CATGATG^TCTCATTTGTACAGGGAATGG 
448 i D C D D R D 



1620 
467 



1621 ^ TA ^ T AGCTGTGGAAACTGTGAATGCTGGGATG^ATGGAATGGAAATG^ 

468 mmm '"" 



1680 
487 



1681 CTGGCTTGGCTCAGAATATCCTTAACAATTACATGAGAGAGGTCTGGATTCTTATTTTTT 1740 
4 88 W L G g r v P * 495 
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1741 CTGGGCCATTAGAACATATAAATGCGAAGGAAACCATGTATATTCACCACTAGGACAGGT 1800 

1801 TAAAAAGACCATTGTATGTTTTTCTATTTCTGAATTACGAATGAAATCCGAGTACCTATT 1860 

1861 AGAAATGAGTTATGCAAATTTAGATGCAAATAACATTAGAAAAAAAAGATTCTTCCATAA 1920 

1921 TTAACATAAGTGGTTCCTAACGAGAGCAATTTTTCCACCCAAAAGTCATTTGGCAACATC 1980 

1981 TACAGACAATTTTGATTGTCACACTK^GTCGGGTAGGAAGGTATGCTGCAGACATTTGGT 2040 

2041 GGGTAGAGGCCAGGGATGCTGCTGAGC ATCCCGCAGTGTACAGGACAGCCCCCAAACAAG 2100 

2101 GAATTATCCAGCCCC AAATGCCAATAGGGCTCAGACTGAGAAACATTGAGTTATATGGCT 2160 

2161 ATTAGAAATCCACATTCTTACACAAGAAAGACCATATTAGAATCTAAGGAAAACATGCAT 2220 

2221 ATTCACATTAATTAATCGATCAGATTTTTCCAGAATTCCGTATCAGTCACCATTTTAA 2280 

2281 TGGGG AC AATGAAGA C AAGC AC AC AGGAGGT AG AAT ATC AG AGT GG GGCTG GATC AAGGG 2340 

2341 CAAAAACTGGTCATTAAGTCATCTGACATTAAATCATTTAGCCACTAAGTTATTTGTGTA 2400 

2401 CTCTCACTTTAAACT(^CCAAAGAAGATTCTCTTAAAGAAATTATGAAAAATGTACAATT 2460 

2461 TAACATTTTAAATAAATAGTGACAGAAGTTGTTTAAAAA 2499 
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Decoration 'Decoration #1': Shade (with solid black) residues that natch the Consensus 
exactly. 
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FIG. 7A 

1 GGCACGAGAGCGTGACCCAGCTGCGGCCGGCCAGCCATGGAGACTGGAGCGCTGCGGCGC 6 0 

1 METGALRR 8 

6 1 CCGCAACTTCTCCCGTTGCTGCTGCTGCTCTGCGGTGGGTGTCCCAGAGC AGGCGGCTGC 120 

9 PQL LPLLLLLCGGCPRAGG C 28 

121 AACGAGACAGGCATGTTGGAGAGGCTGCCCCTGTGTGGGAAGGCTTTCGCAGACATGATG 180 

29NETGMLERLPLCGKAFADMM 48 

181 GGCAAGGTGGACGTCTGGAAGTGGTGCAACCTGTCCGAGTTC ATCGTGTACTATGAGAGT 240 

49GKVDVWKWCNLSEFIVYYES 68 

241 TTCACCAACTGCACCGAGATGGAGGCCAATGTCGTGGGCTGCTACTGGCCCAACCCCCTG 300 

69FTNCTEMEANVVGCYWPNPL 88 

301 GCCCAGGGCTTCATCACCGGCATCC ACAGGCAGTTCTTCTCCAACTGCACCGTGGACAGG 360 

89AQGFITGIHRQFFSNCTVDR 108 

361 GTCCACTTGGAGGACCCCCCAGACGAGGTTCTCATCCCGCTGATCGTTATACCCGTCGTT 420 

109 VHLEDPPDEVLI P L I V I P V v 128 

421 CTGACTGTCGCCATGGCTGGCCTGGTGGTGTGGCGCAGCAAACGCACCGACACGCTGCTG 480 

129 LTVAMA GLVV WRSKRTDTLL 148 

481 TGAGGGTCCCGGTGAGATGGAGTGGGTCACACCTGGCAAGCTGGAAGAAAGTTCCCTGGG 540 

149 * 149 

541 GATGGGAGAGCGGGTGGGTGCTGCCAATCTCCAGCTACTGTGGCCACACCCCACCTGGTC 600 

601 ATGGGCAGACCCCTCCCTTCCTGGGCTGACCTGCTCCCTCGAGGCCAGCCTGCTCCCTGG 660 

661 CTGAGGCTCAGGCTATCCGCCCAAGCTCTTTGCTCATTCTAGGGCCAGTGGAGGAAAATG 720 

721 TGATAAGGCCAGAGCTTGTGTGCTGGGCACAGAAATCACCTGCTGCATCCTGTGCTCCGC 780 

781 AGCTGGGCCGGACCTCTGCCCGC AGGTTTCTATGCTGTTTCTTAGCACAGAATCCAGCCT 840 

841 AGC CTTAGCCGCAGTCTAAGC CCTGCTTGGACTAGGACTC CTTGCTTGACCCCATCTCTG 900 

901 GTTCCTGCCCTGGCTC CTGC ACCAGC CCCAGCTCCTGCCT ACATCCAGGCAG AAAGATAG 960 
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961 CAGGGGCTCTTGGAAGACGTTCCGTGCTGTGACCTCCGAGCCCTCCTGGTGGGAAGACAG 1020 

1021 CTGGAAAGGCTGGGAGGAGAAGGGAGGGGTTGGGGGTTCCCAGGAGCCATGCGTGGCCTG 1080 

1081 C AGAG TC C ATTC CAT CATG ATG CTGTGC CCGC T ATG GGC TG TGTC C ATG AC CAGAG GC TG 1140 

1141 GAGT<KX^GTGTGTTAGAGCCCCTCACCGGGACTTGCTGTGCGGATGGGGCCTGGGCCTC 1200 

1201 CTTCCTACAGGGGCTCC TCTGTGGGTGAGGGGCCCTCTGGAATGGC ATCC C ATGAGCTTG 1260 

1261 TGGCCTCTATCTGCTACCATCTGTGTT^ 1320 
1321 AAAAAAAAAAAAAAAAAAA 1339 
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Decoration ' Decora tion #1': Shade (with solid black) residues that natch the Consensus 
exactly. 
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FIG. 10A 

1 GGCACGAGGACAGCCTCCACCAGAGTCCCCACCTTTCTGGAAGCTGCAGGGCTCTCCATC 60 

61 CAGGATCCAGAAGCATTGAAGGGGACCAGCCGCTGAAGGGATTCTCAGTCCCATCTGACT 120 

121 CCCCATGAGGCTCCTGGCTTTCCTGAGTCTGCTGGCCTTGGTGCTGCAGGAGACAGGGAC 180 
1 MRLLAFLSLLALVLQETGT 19 

181 AGCTTCTCTCCCAAGGAAGGAGAGGAAGAGGAGAGAGGAGCAGATGCCCAGGGAAGGCGA 240 
20 A S LPRKERKRREEQMPREGD 39 

241 TTCCTTTGAAGTTCTGCCTCTGCGGAATGATGTCCTGAACCCAGACAACTATGGTGAAGT 300 

40 SFEVLPLRNDVLNPDNYGEV 59 

301 C ATTG AC C TG AGC AAC T ATG AGG AGC TC AC AG ATT ATGGGGA C C AAC TC CCC G AGGTT AA 360 

6 0 I DLSNYEELTDYGDQLPEVK 79 

361 GGTGACTAGCCTCGCTCCTGCAACC AGC ATCAGTCCCGCCAAGAGCACTACGGCTCCAGG 420 

80 VTSLAPATS I SPAKSTTAPG 99 

421 GACACCCTCGTCAAACCCCACGATGACCAGACCTACTACAGCAGGGCTGCTACTGAGTTC 480 

100 TPSSNPTMTRPTTAGLLLS S 119 

481 CCAGCCCAACCATGGTCTGCCCACCTGCCTGGTCTGCGTGTGCCTCGGTTCCTCTGTGTA 540 

120 QPNHGLPTCLVCVCLGSSVY 139 

541 TTGCGATGACATTGACCTAGAGGACATTCCTCCTCTTCCTCGG^GGACTGCCTACCTGTA 600 

140 CDDIDLEDI PPLPRRTAYLY 159 

601 TGCACGCTTCAACCGC ATC AGCC GTATC AGGGC CGAAGACTTC AAAGGGCTG AC AAAGTT 660 

160 ARFNRI SRIRAEDFKGLTKL 179 

661 GAAGAGGATTGACCTCTCCAACAACCTCATTTCCTCCATCGATAATGATGCCTTCCGCCT 720 

180 KRIDLSNNLISSIDNDAFRL 199 

721 GCTACATGCCCTCCAGGACCTCATCCTCCCAGAGAACCAGTTGGAAGCTCTGCCCGTGCT 780 

200 LHALQDLI LPENQLEALPVL 219 

781 GCCCAGTGGC^TTGAGTTCCT<^ATGT€CGCCTAAATCGGCTCCAGAGCTCGGGGATACA 840 

220 PSGIEFLDVRLNRLQSSGIQ 239 

841 GCCTGCAGCCTTCAGGGC AATGGAGAAGCTGCAGTTCCTTTACCTGTCAGACAACCTGCT 900 

240 PAAFRAMEKLQFLYLSDNLL 259 
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901 GGATTCTATCCCGGGGCCTTTGCCCCCGAGCCTGCGCTCTGTACACCTGCAGAATAACCT 960 

260 DSIPGPLPPSLRSV HLQNNL 279 

961 GATAGAGACCATGCAGAGAGACGTCTTCTGTGACCCCGAGGAGCACAAACACACCCGCAG 1020 

280 IETMQRDVFCDPEEHKHTRR 299 

1021 GCAGCTGGAAGACATCCGCCTGGATGGCAACCCCATCAACCTCAGCCTCTTCCCCAGCGC 1080 

300 Q LEDIRLDGNPINLSLFPSA 319 

1081 CTACTTCTGCCTGCCTCGGCTCCCCATCGGCCGCTTCACGTAGCTCGGAGCCCTTCCACT 1140 

320 YFCLPRLPIGRF T* 333 

1141 CCTCCCAGGTCATCTCTTGGACCAGCGGGCATCACATTCTCCAGCAGCCGCCATCTCACA 1200 

1201 CGCCTCCCTCCTGTGGCCGCCGGCAGCATGGACAAAGGTCTCCATGCAGGGGGAGGAGGC 1260 

1261 CTGCTTCTTTCCCCACAGCTCTCACGTCTCCCTTCTCCCTGCGGGTGACAAAGAAGCCCA 1320 

1321 AGGACCACCTCCTTCCTGCCTCATTGTAATAAAATTCCCCACACTGAAAAAAAAAAAAAA 1380 

1381 AAAAAAAAA 1389 
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Decoration 'Decoration #1' : Shade (with solid black) residues that natch the Consensus 
exactly. 
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FIG. 13A 



1 CCACGCGTCCGAGAGAACAGGCCTGTCTCAGGCAGGCCCTGCGCCTCCTATGCGGAGATG 6 0 

1 Ml 

6 1 CTACTGCCACTGCTGCTGTCCTCGCTGCTGGGCGGGTCCCAGGCTATGGATGGGAGATTC 120 

2 LLP LLLSSLLGGSQA M D G R F 21 

121 TGGATACGAGTGC AGGAGTC AGTGATGGTGCCGGAGGC CTGTGAC ATCTCTGTGCC CTGC 180 

22WIRVQESVMVPEACDI SVPC 41 

181 TCTTTCTCCTACCCCCGACAAGACTGGACAGGGTCTACCCCAGCTTATGGCTACTGGTTC 240 

42SFSYPRQDWTGSTPAYGYWF 61 

241 AAAGCAGTGACTGAGACAACCAAGGGTGCTCCTGTGGCCACAAACCACCAGAGTCGAGAG 300 

62KAVTETTKGAPVATNHQSRE 81 

3 01 GTGGAAATGAGCACCCGGGGCCGATTCCAGCTCACTGGGGATCCCGCCAAGGGGAACTGC 360 

82VEMSTRGRFQLTGDPAKGNC 101 

361 TCCTTGGTGATCAGAGACGCGCAGATGCAGGATGAGTCACAGTACTTCTTTCGGGTGGAG 420 

102SLVIRDAQMQDESQYFFRVE 121 

421 AGAGGAAGCTATGTGAGATATAATTTCATGAACGATGGGTTCTTTCTAAAAGTAACAGTG 480 

122RGSYVRYNFMNDGFFLKVTV 141 

481 CTCAGCTTCACGC CCAGACCCC AGGACC AC AACAC CGACCTC ACCTGCC ATGTGGACTTC 540 

142 L SFTPRP QDHNTDLTCHVDF 161 

541 TCCAGAAAGGGTGTGAGCGCACAGAGGACCGTCCGACTCCGTGTGGCCTATGCCCCCAGA 600 

162SRKGVSAQRTVRLRVAYAPR 181 

601 GACCTTGTTATCAGC ATTTC ACGTGAC AAC ACGCC AGCCC TGGAGCCCCAGCCC C AGGGA 660 

182 DLVISI SRDNTPALEPQPQG 201 

661 AATGTCCCATACCTGGAAGCCCAAAAAGGCCAGTTCCTGCGGCTCCTCTGTGCTGCTGAC 720 

202 NVPYLEAQKGQFLRLLCAAD 221 

721 AGCCAGCCCCCTGCCACACTGAGCTGGGTCCTGCAGAACAGAGTCCTCTCCTCGTCCCAT 780 

222 SQPPATLSWVLQNRVLSSSH 241 

781 CCCTGGGGCCCTAGACCCCTGGGGCTGGAGCTGCCCGGGGTGAAGGCTGGGGATTCAGGG 840 

242 PWGPRPLGLELPGVKAGDSG 261 
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FIG. 13B 

841 CGCTACACCTGCCGAGCGGAGAACAGGCTTGGCTCCCAGCAGCGAGCCCTGGACCTCTCT 900 

262 RYTCRAENRLGSQQRALDLS 281 

901 GTGCAGTATCCTCCAGAGAACCTGAGAGTGATGGTTTCCCAAGCAAACAGGACAGTCCTG 960 

282 VQYPPENLRVMVSQANRTVL 301 

961 GAAAACCTTGGGAACGGCACGTCTCTCCCAGTACTGGAGGGCCAAAGCCTCTGCCTGGTC 1020 

302 ENLGNGTSLPVLEGQSLCLV 321 

1021 TGTGTCACACACAGCAGCCCCCCAGCCAGGCTGAGCTGGACCCAGAGGGGACAGGTTCTG 1080 

322 CVTHSSPPARLSWTQRGQVL 341 

1081 AGCCCCTCCCAGCCCTCAGACCCCGGGGTCCTGGAGCTGCCTCGGGTTCAAGTGGAGCAC 1140 

342 SPSQPSDPGVLELPRVQVEH 361 

1141 GAAGGAGAGTTCACCTGCCACGCTCGGCACCCACTGGGCTCCCAGCACGTCTCTCTCAGC 1200 

362 EGEFTCHARHPLGSQHVSLS 381 

1201 CTCTCCGTGCACTACTCCCCGAAGCTGCTGGGCCCCTCCTGCTCCTGGGAGGCTG^ 1260 

382 LSVHYSPKLLGPSCSWEAEG 401 

1261 CTGCACTGCAGCTGCTCCTCCCAGGCCAGCCCGGCCCCCTCTCTGCGCT^TGGCTTGGG 1320 

402 LHCSCSSQASPAPSLRWWLG 421 

1321 GAGGAGCTGCTGGAGGGGAACAGCAGCCAGGACTCCTTCGAGGTCACCCCCAGCTCAGCC 1380 

422 EELLEGNSSQDSFEVTPSSA 441 

13 81 GGGCCCTGGGCCAACAGCTCCCTGAGCCTCCATGGAGGGCTCAGCTCCGGCCTCAGGCTC 1440 

442 GPWANS S LS LHGGL S SGLRL 461 

1441 CGCTGTGAGGCCTGGAACGTCCATGGGGCCCAGAGTGGATCCATCCTGCAGCTGCCAGAT 1500 

462 RCEAWNVHGAQSGSILQLPD 481 

1501 AAGAAGGGACTCATCTCAACGGCATTCTCCAACGGAGCGTTTCTGGGAATCGGCATCACG 1560 
482 KKGLlSTAFSNGAF*~-~-- 



501 



1561 GCTCTTCTTTTCCTCTGCCTGGCCCTGATCATCATGAAGATTCTACCGAAGAGACGGACT 1620 

502 ALLFLrT 'ALIT MKILPKRRT 521 

1621 CAGACAGAAACCCCGAGGCCCAGGTTCTCCCGGCACAGCACGATCCTGGATTAC ATCAAT 1680 

522 QTETPRPRFSRHSTILDYIN 541 
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1681 GTGGTCCCGACGGCTGGCCCCCTGGCTCAGAAGCGGAATCAGAAAGCCACACCAAACAGT 1740 

542 VVPTAGPLAQKRNQKATPNS 561 

1741 CCTCGGACCCCTCTTCCACCAGGTGCTCCCTCCCCAGAATCAAAGAAGAACCAGAAAAAG 1800 

562 PRTPLPPGAPSPESKKNQKK 581 

1801 CAGTATC AGTTGCCC AGTTTCCC AGAACC C AAATCATCC ACTC AAGCC CCAGAATCCC AG 1860 

582 QYQLPSFPEPKSST.QAPESQ 601 

1861 GAGAGCCAAGAGGAGCTCC ATTATGCC ACGCTC AACTTC CC AGGCGTC AGAC C C AGGC CT 1920 

602 ESQEELHYATLNFPGVR PRP 621 

1921 GAGGC CCGGATGCCC AAGGGCAC CCAGGCGGATT ATGC AGAAGTCAAGTTCC AATGAGGG 1980 

622 EARMPKGTQADYAEVKFQ* 640 

1981 TCTCTTAGGCTTTAGGACTGGGACTTCGGCTAGGAGGAAGGTAGAGTAGAAGTTGAAGAT 2040 

2041 AACCGAGTGCAAGAGTTTCCTTCTCTCCCTCTCTCTCTCTCTTTCTCTCTCTCTCTCTCT 2100 

2101 TTCTCTCTCTTTTTAAAAAAACTCTGGCCACGGCACAGT(^CTCACGCCTGTAATCCCAG 2160 

2161 CACTTTGGGAAGTTGAAGTTGGGCCAAATCCCCTGAGTCCGGATTCGAA^CAGCCTGGC 2220 

2221 C AAC TTGTG AAAAC C CGTC TCT AC TT AAAAAT AC C C AAATT ACC TGGG C ATTG TGG CAGG 2280 

2281 GGCCTGTTTATCCTT 2295 
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841 GAGGGCCAGTCTCTGCGCTTGGTCTGTGCTGTTGACAGCAATCCCCCTGCCAGGCTGAG 900 

268 EGQSLRLVCAVDSNPPARLS 287 

901 TGGACCIXK^GGAGTCTGACCCTGTACCCCTCACAGCCCTCAAACCCTCTGGTACTGGAG 960 

288 WTWRSLTLYPSQPSNPLVLE 307 

961 CTGCAAGTGCACCTGGGGGATGAAGGGGAATTCACCTGTCGAGCTCAGAACTCTCTGGGT 1020 

308 LQVHLGDEGEFTCRAQNSLG 327 

1021 TCCCAGCACGTTTCCCTGAACCTCTCCCTGCAACAGGAGTACACAGGCAAAATGAGGCCT 1080 

328 SQHVSLNLSLQQEYTGKMRP 347 

1081 GTATCAGGAGTGTTGCTGGGGGCGGTCGGGGGAGCTGGAGCCACAGCCCTGGTCTTCCTC 1140 

348 VSGVLIjGAVGGA G A T A t, y r t. 367 

1141 TCCTTCTGTGTCATCTTCATTGTAGTGAGGTCCTGCAGGAAGAAATCGGC^ 1200 

368 SFCVTFT\7\7 RSCRKKSARPA 387 

1201 GCGGACGTGGGAGACATAGGCATGAAGGATGCAAAC ACCATCAGGGGCTCAGCCTCTCAG 1260 

388 ADVGDI GMKDANTIRGSASQ 407 

1261 GGTAACCTGACTGAGTCCTGGGCAGATGATAACCCCCGACACCATGGCCTGGCTGCCCAC 1320 

408 GNLTESWADDNPRHHGLAAH 427 

1321 TCCTCAGGGGAGGAAAGAGAGATC C AGTATGCAC CCCTCAGCTTTC ATAAGGGGGAGCCT 13 80 

428 SSGEEREI QYAPLSFHKGEP 447 

1381 CAGGACCTATCAGGTCAAGAAGCCACCAACAATGAGTACTCAGAGATCAAGATCCCCAAG 1440 

448 QDLSGQEATNNEYSEIKIPK 467 

1441 TAAGAAAATGCAGAGGCTCGGGCTTGTTTGAGGGTTCACGACCCCTCCAGCAAACK5AGTC 1500 

468 * 468 

1501 TGAGGCTGATTCCAGTAGAATTAGCAGCCCTCAATGCTGTGCAACAAGACATCAGAACTT 1560 

1561 ATTCCTCTTGTCTAACTGAAAATGCATGCCTGATGACCAAACTCTCCCm 1620 

1621 ATCGGTCCACACTCCCCGCCCTGGCCTCTGGTACCCACCATTCTCCTCTGTACTTCTCTA 1680 

1681 AGGATGACTACTTTAGATTCCGAATATAGTGAGATTGTAACGTGAAAAAAAAAAAAAAAA 1740 
1741 AAAAAAAA 1748 
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Decoration 'Decoration #1' : Shade (with solid black) residues that natch the Consensus 
exactly. 
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GCCGCGCCGAGGAGGCTGCCGCTCTGGCTTGCCGCCCCCCGCCGCCGCTGCACACCGGAC 



60 



61 CCAGCCGCCGTGCCGCGGGCCATGGACCTGCCCAGGGGCCTGGTGGTGGCCTGGGCGCTC 120 



1 



MDLPRGLVVAWAL 13 



121 AGCCTGTGGCCAGGGTTCACGGAC AC CTTC AAC ATGGAC ACC AGGAAGC CCCGGGTC ATC 180 

14 SLWPGFTDT FNMDTRKPRVI 33 

181 C CTGGCTCCAGGACCGCCTTCTTTGGCT AC AC AGTGC AGC AGCAC GACATC AGTGGC AAT 240 

34PGSRTAFFGYTVQQHDISGN 53 

241 AAGTGGCTGGTCGTGGGCGCCC C ACTGGAAAC C AATGGCTAC C AG AAG ACGGGAGACGTG 300 

54KWLVVGAPLETNGYQKTODV 73 

301 TACAAGTGTCCAGTGATCCACGGGAACTGCACCAAACTCAACCTGGGAAGGGTCACCCTG 360 

74YKCPVIHGNCTKLNLGRVTL 93 

361 TCCAACGTGTCCGAGCGGAAAGACAACATGCGCCTCGGCCTTAGTCTCGCCACCAACCCC 420 

94SMVSERKDNMRLGLSLATNP 113 

421 AAGGACAACAGCTTCCTGGCCTGCAGCCCCCTCTGGTCTCATGAGTGTGGGAGCTCCTAC 480 

114KDNSFLACSPLWSHECGSSY 133 

481 TACACCACAGGGATGTGTTCAAGAGTCAACTCCAACTTCAGGTTCTCCAAGACCGTGGCC 54 0 

134YTTGMCSRVNSNFRFSKTVA 153 

541 CCAGCTCTCCAAAGGTGCCAGACCTACATGGACATCGTCATTGTCCTGGATGGCTCCAAC 600 

154PALQRCQTYMDIVIVLDGSN 173 

601 AGC ATCTACCCCTGGGTGGAGGTTC AGC AC TTC CTCATC AACATCCTGAAAAAGTTTTAC 660 

174SIYPWVEVQHFLINILKKFY 193 

661 ATTGGCCCAGGGCAGATCCAGGTTGGAGTTGTGCAGTATGGCGAAGATGTGGTGCATGAG 720 

194IGPGQIQVGVVQYGEDVVHE 213 

721 TTTCACCTCAATGACTACAGGTCTGTAAAAGATGTGGTGGAAGCTGCCAGCCACATTGAG 7 8 0 

214FHLNDYRSVKDVVEAASHIE 233 

781 C AGAGAGG AGG AAC AGAG AC C C GGACGG C ATTTGGC ATTGAATTTGC AC GC TC AG AGGC T 840 

234 QRGGTETRTAFGI EFARSEA 253 
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841 TTCCAGAAGGGTGGAAGGAAAGGAGCCAAGAAGGTGATGATTGTCATCACAGATGGGGAG 900 
254 FQKGGRKGAKKVMIVI TDGE 273 

901 TCCC ACGAC AGCC CAGAC CTGGAGAAGGTGATC C AGC AAAGCGAAAG AGACAACGTAACA 960 
274 SHDSPDLEKVIQQSERDNVT 293 

961 AGATATGCGGTGGC CGTCC TGGGCT ACTAC AACCGCAGGGGG ATC AATCCAG AAACTTTT 1020 
294 RYAVAVLGYYNRRGINPETF 313 

1021 CTAAATGAAATCAAATACATCGCCAGTGACCCTGATGACAAGC ACTTCTTCAATGTCACT 1080 
314LNEIKYIASDPDDKHFFNVT 333 

1081 GATGAGGCTGCCTTGAAGGACATTGTCGATGCCCTGGGGGACAGAATCTTCAGCCTGGAA 1140 
334 DEAALKDIVDALGDRIFSLE 353 

1141 GGCACCAACAAGAACGAGACCTCCTTTGGGCTGGAGATGTCACAGACGGGCTTTTCCTCG 1200 

354 GTNKNETSFGLEMSQTGFSS 373 

1201 CACGTGGTGGAGGATGGGGTTCTGCTGGGAGCCGTCGGTGCCTATGACTGGAATGGAGCT 1260 

374 HVVEDGVLLGAVGAYDWNGA 393 

1261 GTGCTAAAGGAGACGAGTGCCGGGAAGGTCATTCCTCTCCGCGAGTCCTACCTGAAAGAG 1320 

394 VLKETSAGKVIPLRESYLKE 413 

1321 TTCCCCGAGGAGCTCAAGAACCATGGTGCATACCTGGGGTACACAGTCACATCGGTCGTG 1380 

414 F PEEL KNHGAYLGYTVTSVV 433 

13 81 TC CTC C AGGCAGGGGCGAGTGTACGTGGCCGGAGCCCCC CGGTTC AACCAC ACGGGC AAG 144 0 

434 SSRQGRVYVAGAPRFNHTGK 453 

1441 GTCATCCTGTTCACCATGCACAACAACCGGAGCCTCACCATCCACCAGGCTATGCGGGGC 1500 

454 VI LFTMHNNRSLTI HQAMRG 473 

1501 CAGCAGATAGGCTCTTACTTTGGGAGTGAAATCACCTCGGTGGACATCGACGGCGACGGC 1560 

474 QQIGSYFGSEITSVDIDGDG 493 

1561 GTGACTGATGTCCTGCTGGTGGGCGCACCCATGTACTTC AACGAGGGCCGTGAGCGAGGC 1620 

494 VTDVLLVGAPMYFNEGRERG 513 

1621 AAGGTGTACGTCTATGAGCTGAGACAGAACCGGTTTGTTTATAACGGAACGCTAAAGGAT 1680 

514KVYVYELRQNRFVYNGTLKD 533 
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1681 TC AC AC AGTTACC AG AATGCC CGATTTGGGTCCTCC ATTGC CTC AGTTCGAGACCTCAAC 1740 

534 SHS YQNARFGSSIASVRDLN 553 

1741 CAGGATTCCTACAATGACGTGGTGGTGGGAGCCCCCCTGGAGGACAACCACGCAGGAGCC 1800 

554 QDSYNDVVVGAPLEDNHAGA 573 

1801 ATCTAC^TCTTCCACGGCTTCCGAGGCAGCATCCTGAAGACACCTAAGCAGAGAATCACA 1860 

574 IYIFHGFRGS ILKTPKQRIT 593 

1861 GC CTC AGAGCTGGCT ACCGGCC TCC AGTATTTTGGCTGC AGC ATCC AC GGGCAATTGG AC 1920 

594 ASELATGLQYFGCSIHGQLD 613 

1921 CTCAATGAGGATGGGCTCATCGACCTGGCAGTGGGAGCCCTTGGCAACGCTGTGATTCTG 1980 

614 LNEDGL IDLAVGALGNAVI L 633 

1981 TGGTC CCGCCCAGTGGTTC AG ATC AATGC CAGCCTCC ACTTTGAGCC ATC C AAGATCAAC 2040 

634 WSRPVVQINASLHFEPSKIN 653 

2041 ATCTTCC ACAGAGACTGCAAGCGCAGTGGCAGGGATGCCACCTGCCTGGCCGCCTTCCTC 2100 

654 I FHRDCKRSGRD A T C I. - - - - 



673 



2101 TGCTTCACGCCCATCTTCCTGGCACCCCATTTCCAAACAACAACTGTTG^ATCAGATAC 2160 

674 CFTPIFLAP HFQTTTVGIRY 693 

2161 AACGCCACCATGGATGAGAGGCGGTATACACCGAGGGCCCACCTGGACGAGGGCGGGGAC 2220 

694 NATMDERRYTPRAHLDEGGD 713 

2221 CGATTCACCAACAGAGCCGTACTGCTCTCCTCCGGCCAGGAGCTCTGTGAGCGGATCAAC 2280 

714RFTNRAVLLSSGQELCERIN 733 

2281 TTCCATGTCCTGGACACTGCTGACTACGTGAAGCCAGTGACCTTCTCAGTCGAGTATTCC 2340 

734 FHVLDTADYVKPVTFSVEYS 753 

2341 CTGGAGGACCCTGACCATGGCCCCATGCTGGACGACGGCTGGCCCACCACTCTCAGAGTC 2400 

754 LEDPDHG PMLDDGWPTTLRV 773 

2401 TCGGTGCCCTTCTGGAACGGCTGCAATGAGGATGAGCACTGTGTCCCTGACCTTGTGTTG 2460 

774 SVPFWNGCNEDEHCVPDLVL 793 

2461 GATC^CCGGAGTGACCTGCCCACGGCCATGGAGTACTGCCAGAGGGTGC^GGAAGCCT 2520 

794 DARSDL PTAMEYCQRVLRK P 813 
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2521 GCGCAGGACTGCTCCGCATACACGCTGTCCTTCGACACCACAGTCTTC ATCATAGAGAGC 2580 
814 AQDCSAYTLSFDTTVFI IES 833 



2581 ACACGCCAGCGAGTGGCGGTGGAGGCCACACTGGAGAACAGGGGCGAGAACGCCTACAGC 2640 
834 TRQRVAVEATLENRGENAYS 853 



2641 ACGGTCCTAAATATCTCGCAGTCAGCAAACCTGCAGTTTGCCAGCTTGATCCAGAAGGAG 2700 

854 TVLNI SQSANLQFASLIQKE 873 

2701 GACTC AGACGGTAGCATTGAGTGTGTGAACGAGGAGAGGAGGCTCCAGAAGCAAGTCTGC 2760 

874 DSDGSIECVNEERRLQKQVC 893 



2761 AACGTCAGCTATCCCTTCTTCCGGGCCAAGGCCAAGGTGGCTTTCCGTCTTGATTTTGAG 2820 
894 NVSYPFFRAKAKVAFRLDFE 913 



2821 TTCAGCAAATCC ATCTTCCTACACCACCTGGAGATCGAGCTCGCTGCAGGCAGTGACAGT 2880 
914FSKSIFLHHLEIELAAGSDS 933 



2881 AATGAGCGGG AC AGC ACC AAGG AAGAC AACGTGG C C C C C TT ACGC TTCC AC CTC AAAT AC 2940 
934 NERDSTKEDNVAPLRFH LKY 953 



2941 GAGGGTGACGTCCTCTTCACCAGGAGCAGCAGCCTGAGCCACTACGAGGTCAAGCTCAAC 3000 

954 EADVLFTRS SSLSHYEVKLN 973 

3001 AGCTCGCTGGAGAGATACGATGGTATCGGGCCTCCCTTCAGCTGCATCTTCAGGATCCAG 3060 

974 SSLERYDGIGPPFSCIFRIQ 993 

3061 AACTTGGGCTTGTTCCCCATCCACGGGATTATGATGAAGATCACCATTCCCATCGCCACC 3120 

994 NLGLFPIHGIMMKITIPIAT 1013 



3121 AGGAGCGGCAACCGCCTACTGAAGCTGAGGGACTTCCTCACGGACGAGGTAGCGAACACG 3180 
1014 RSGNRLLK LRDFLTDEVANT 1033 



3181 TCCTGTAACATCTGGGGCAATAGCACTGAGTACCGGCCC ACCCCAGTGGAGGAAGACTTG 3240 
1034 SCNIWGNSTEYRPTPVEEDL 1053 



3241 CGTCGTGCTCCACAGCTGAATCAC AGCAACTCTGATGTCGTCTCCATCAACTGCAATATA 3300 
1054 RRAPQLNHSNSDVVS INCNI 1073 



3301 CGGCTGGTCCCCAACCAGGAAATCAATTTCCATCTACTGGGGAACCTGTGGTTGAGGTCC 3360 
1074 RLVPNQEINFHLLGNLWLRS 1093 
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3361 CTAAAAGCACTC AAGTACAAATCC ATGAAAATCATGGTCAACGCAGCCTTGCAGAGGC AG 3420 
1094 LKALK.YKSMK IMVNAALQRQ 1113 

3421 TTCCACAGCCCCTTCATCTTCCGTGAGGAGGATCCCAGCCGCCAGATCGTGTTTGAGATC 3480 
1114 FHSPFIFREEDPSRQIVFEI 1133 

3481 TCCAAGCAAGAGGACTGGCAGGTCCCCATCTGGATCATTGTAGGCAGC ACCCTGGGGGGC 3540 
1134 SKQEDWQVPIW IIVGSTT, RR 1153 

3541 CTCCTACTGCTGGCCCTGCTGGTCCTGGCACTGTGGAAGCTCGGCTTCTTTAGAAGTGCC 3600 

1154 LLLLALLV LALWKLGFFRSA 1173 

3601 AGGCGCAGGAGGGAGCCTGGTCTGGACCCCACCCCC AAAGTGCTGGAGTGAGGCTCCAGA 3660 

1174 RRRREPGLDPTPKVLE* H90 

3661 GGAGACTTTGAGTTGATGGGGGCC AGGAC AC CAGTCC AGGT AGTGTTGAG AC CC AGGCCT 3720 

3721 GTGGCCCCACCGAGCTGGAGCGGAGAGGAAGCCAGCTGGCTTTGCACTTGACCTCATCTC 3780 

3781 CCGAGCAATGGCGCCTGCTCCCTCCAGAATGGAACTCAAGCTGGTTTTAAGTGGAACTGC 3840 

3841 CCTACTGG^GACTGGGACACCTTTAACACAGACCCCTAGGGATTTAAAGGGACACCCCT 3900 

3901 ACACACACCCAGGCCC^CGCCAAGGCCTCCCTCAGGCTCTGTGGAGGGCATTTGCTGCCC 3960 

3961 CAGCT ACTAAGGTGC TAGG AATTCGTAATCATCCCC ATCCTCCAGAGAAAC CC AGGGAGG 4020 

4021 AAGACTGTAAATACGAACCC AATCTGC AC ACTCC AGGC CTC TAGTTCC AGAAGGATCC AA 4080 

4081 GACAAAACAGATCTGAATTCTGCCCTTTTCTCTCACCCATCCCACCCCTCCATTG^ 4140 

4141 CAAGTCAC AC CCACTCCCTTCCCCATAGATAGGCC CCTGGGGCTCCCGAAGAATGAAC CC 4200 

4201 AAGAGCAAGGGCTTGATGGTGACAGCTGCAAGCCAGGGATGAAGAAAGACTCTGAGATGT 42 60 

4261 GX5AGACTGATGGCCAGGCAAGTGGGACCAGGATACTGGACGCTGTCCTGAGATGAGAGGT 4320 

4321 AGCCGGGCTCTGCACCCACGTGCATTCACATTGACCGCAACTCACACATTCCCCCACCAG 4380 
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4381 CTGCAGCCCCTTGCTCTCAGCTGCCAACCCTCCCGGGTCACTTTTGTTCCCAGGTACCTC 4440 

4441 ATGGGAAGCATGTGGATGACACAATCCCTGGGGCTGTGCATTCCCACGTCTTCTTGCTGC 4500 

4501 AGCCTGCCCCTAGACATGGACGCACCGGCCTGGCTGCAGCTGGGCAGCAGGGGTAGGGGT 4560 

4561 AGGGAGCCTCCC CTCCCTGTATC ACCCCC TCCCTAC AC AC AC AC AC AC ACAC AC ACAC AC 4620 

4621 AC ACTGC CTC CC ATC C TTC C CTC ATGC C C GCC AG TGC AC AGGGAAGGG CTTGGC C AGC GC 4680 

4681 TGTTGAGGGGTCCCCTCTGGAATGCACTGAATAAAGCACGTGCAAGGACTCCCGGAGCCT 4740 

4741 GTGCAGCCTTGGTGGCAAATATCTCATCTGCCGGCCCCCAGGACAAGTGGTATGACCAGT 4800 

4801 GATAATGCCCCAAGGACAAGGGGCGTGCCTGGCGCCCAGTGGAGTAATTTATGCCTTAGT 4860 

4861 CTTGTTTTGAGGTAGAAATGCAAGGGGGACACATGAAAGGCATCAGTCCCCCTGTGCATA 4920 

4921 GT ACGACCTTTACTGTC GTATTTTTGAAAAATTAAAAATAC AGTGTTTAAAAACAAAAAA 4980 

4981 AAAAAAAAAAAAAAA 4995 
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Decoration 'Decoration #1* : Siade (with solid black) residues that natch the Consensus 
exactly. 
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1 GGCACGAGCTGTCATCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCT 60 



M A 2 



6 1 CTCATGCTCAGTTTGGTTCTGAGTCTCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTT 120 
3 LML SLVLSLLKLGSG Q W Q V F 22 

121 GGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAGGACGCAGCATTCTCCTGTTTCCTG 180 
23GPDKPVQALVGEDAAFSCFL 42 

181 TCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGGCCAGTTCTCTAGC 240 
43 SPKTNAEAMEVRFFRGQFSS 62 

241 GTGGTCCACCTCTAC AGGGACGGGAAGGAC CAGCC ATTTATGCAGATGC C AC AGTATC AA 300 
63VVHLYRDGKDQPFMQMPQYQ 82 

301 GGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTO 360 
83 GRTKLVKDSIAEGRI SLRLE 102 

361 AAC ATTAC TGTGTTGGATGC TGGC CTC T ATGGGTGC AGG ATT AGTTC C C AGTC TT ACT AC 420 

103 N I TVLDAGLYGCR I SSQ SYY 122 

421 CAGAAGGCCATCTGGGAGCTACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATC 480 

123QKAIWELQVSALGSVPLISI 142 

481 GCGGGATATGTTGATAGAGACATCCAGCTACTCTGTCAGTCCTCGGGCTt^TTCCCCC^ 540 

143AGYVDRDIQLLCQSSGWFPR 162 

541 CCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCCACAGACTCCAGGACAAAC 600 

163PTAK WKGPQGQDLSTDSRTN 182 

601 AGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCC^ 660 

183RDMHGLFD VEISLTVQE NAG 202 

661 agcatatcctgttccat^ggcatg^^ 720 

203 SI SCSMRHAHLSREVESRVQ 222 

721 ATAGGAGACTCGAGAAGAAAGCACGGACAGGCAGGTAAAAGAAAATATTCCTCTTCACAC 780 

223 IGDWRRKHGQAGKRKYSSSH 242 

781 ATTTATCACTCCTTTCCAAGTCOT 8 4 0 

243 IYDSFPSLSFMDFYILRPVG 262 
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841 CCCTGCAGAGCCAAGCTTGTGATGGGAACTCTGAAATTGCAGATTCTGGGGGAGGTGCAT 900 

263 PCRAKLVMGTLKLQILGEVH 282 

9 0 1 TTTGTAGAGAAGC CCC AT AGCCTTCTTC AG ATCTCTGGAGGGTCC AC AAC AC TCAAAAAG 960 

283 FVEKPHSLLQ I SGGSTTLKK 302 

9 6 1 GGTCCCAATCCTTGGTCTTTCCCTTCTCCCTGCGCCCTGTTTCCC ACGTGAGCACGGAAC 1020 

303 GPNPWSFPS PCALFPT* 319 

1021 TGCCTGCTCTCTCTGCTTGCTTTCAGAATTGAGAGACGCCCGGAAACACGCAGGTACCAA 1080 

1081 CGCCTGAGAGGGTAACAGTGGGCATGGAGTAGGAAGATGACCAGTGACAGATATGGAGCC 1140 

1141 CATCCAGCTTGTAGACAGCAAATCTGTGATGCCCGAATCCACCCCAGGGTGCAGCTGCCT 1200 

1201 CTAAATACACTTCTTGGCCCAGGACTTGGAGGGAAAAGCGTAGGGACTGGGTCAGCTAGG 1260 

1261 AGGGGTCACAGGCAAGACGCCAGGGAACTGAGGGCATTAGTAGCTGGCTTCTAGGGGTCT 1320 

1321 GTGC AAAGGGG AAC GAAGTGAAGTTAGC AGG AAC TGGTGGG TGG AAGGAAGCTG AATC CT 1380 

1381 GGAGTCACTCAAGGTCTCACAAAGTCAAATAGAGGGCTTACGTGGGAGGGCAGTGGTAGG 1440 

1441 GCTGGGTGAACATCTCATGGTTGAGCATCTCCAAGCATCAGTGAGGCACGGGGGCTGCCC 1500 

1501 TGGAG AAGGTAC ATGGCTGGTGGG ATAGTGGGACTGGCCGGATCCT AC C CGGAGCC AGTC 1560 

1561 TGCAGTGGGAGGGTCGACCTCTTGCTCCAGCCCAGATTTCGTCTTCAGTAACTCATGCTT 1620 

6 2 1 C CTCTCTCCC CC ACC GC ACCCC AGTGG AGGTG ACTCTGGATCCAGAGACGGCTCACCCG A 1680 

1681 AG CTCTGCGTTTCTGATCTGAAAACTGTAAC CCATAGAAAAGCTCCTC A^AGGTGCCTC 1740 

1741 AC TC TGAGAAGAGATTT AC AAGGAAG AGTGTGGTGG CTTCTC AGGGTTTCC AAGC AGGGA 1800 

1801 AACATTACTGGGAGGTGGACGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGG I860 

1861 ATGACGTAGACAGGGGGAAGAACAATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCC 1920 



1 
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1921 TC AGACTGACAACAGAAC ATTTGTATTTC AC ATTCAATCCC C ATTTTATCAGC CTCCCCC 1980 

1981 CCAGCACCCCTCCTACACGAGTAGGGGTCTTCCTGGACTATGAGGGTGGGACCATCTCCT 2040 

2041 TCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGCTGACATGTCAGTTTGAAGGCT 2100 

2101 TGTTGAGACCCTATATCCAGCATGCGATGTATGACGAGGAAAAGGGGACTCCC ATATTCA 2160 

2161 TATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGCCCC ACACCACA 2220 

2221 GACCCAGACACAGCC AAGGGAGAGTGCTCCCGACAGGTGGCCCC AGCTTCCTCTCCGGAG 2280 

2281 CCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCGT 2340 

2341 GAGCCCTGCAGCAGCGGCAGTCAC AGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTG 2400 

2401 GGAGTCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGT 2460 

2461 TTGTGAAAACTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCA 2520 

2521 TTTGCTAGTCACGGACAGTGATTCCTGCCTCACAGGTGAAGATTAAAGAGACAACGAATG 2580 

2581 TG AATC ATGC TTGCAGGTT TG AGGGC C AC AGTGTTTGCT AATGG ATGTG TTT TTATGATT 2640 

2641 AT AC ATTTTC C C C AC C AT AAAAC T CTG TTTGCCT T AATTCC C AC ATT AA^T AACTTTTC 2700 

2701 CTCCTATACCCAAATCCACCCATGGAATAGTTAATTGGAACACCTGCCTTTGTGAGGCTC 2760 

2761 CAAAGAATAAAGAGGAGGTAGGATTTTTCACTGATTCTATAAGCCCAGCA™ 2820 

2821 CCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCCCTCA 2880 

2881 TT AAC AC AG AC AC AAAAATTCT AAAT AAAAT TTT AACAAATTAAAC TAAAC AAT AT AT TT 2940 

2941 AAAGATGATATATAACTACTCAGTGTGGTTTGTCCCACAAATGCAGAGTTCGTTTAATAT 3000 
3001 TTAAATATCAACCAGTGTAATTCAGCACATTAATAAAGTAAAAAAAAAAAAAAAAAAAA 3059 
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1 GGCACGAGGGGGCAGGCATGGGAGCCGCGCGCTCTCTCCCGGCGCCC ACACCTGTCTGAG 6 0 

6 1 CGGCGCAGCGAGCCGCGGCCCGGGCGGGCTGCTCGGCGCGGAACAGTGCTCGGCATGGCA 120 

1 M A 2 

121 GGGATTCCAGGGCTCCTCTTCCTTCTCTTCTTTCTGCTCTGTGCTGTTGGGCAAGTGAGC 180 

3 G 1 PGL LFLLFFLLCAVG Q V S 22 

181 CCTTACAGTGCCCCCTGGAAACCCACTTGGCCTGCATACCGCCTCCCTGTCGTCTTGCCC 240 

23PYSAPWKPTWPAYRLPVVLP 42 

241 CAGTCTACCCTCAATTTAGCCAAGCCAGACTTTGGAGCCGAAGCCAAATTAGAAGTATCT 300 

43QSTLNLA KPDFGAEAKLEVS 62 

3 0 1 TCTTCATGTGGACCCCAGTGTCATAAGGGAACTCCACTGCCCACTTACGAAGAGGCCAAG 360 
63SSCGPQCHKGTPLPTYEEAK 82 

361 C AAT AT C TGTC TT ATG AAACGCTC T ATGCC AATGG C AGC C G C AC AGAG AC GC AGGTGGGC 420 

83QYLSYETLYAN GSRTETQVG 102 

421 ATCTACATCCTCAGCAGTAGTGGAGATGGGGCCCAACACCGAGACTCAGGGTCTTCAGGA 480 

103IYILSSSGDGAQHRDSGSSG 122. 

4 8 1 ^GTCTCGAAGGAAGCGGCAGATTTA^ 540 

142 



123 KSRRKRQIYGYDSRFSI FGK 



541 GACTTCCTGCTCAACTACCCTTTCTCAACATCAGTGAAGTTATCCACGGGCTGCACCGGC 600 

143DFLLNYPFSTSVKLSTGCTg 162 

601 ACCCTGGTGGCAGAGAAGCATGTCCTCACAGCTGCCCACTGCATACACGATGGAAAAACC 660 

163 ^^VAFiKHVT, TAAHCIHn^V>r 182 

661 TATGTGAAAGGAACCCAGAAGCTTCGAGTGGGCTTCCTAAAGCCCAAGTTTAAAGATGGT 720 

183 Y V K G T P KLRVGFLKPKFKDG 202 

721 GGTCGAGGGGCCAACGACTCCACTTCAGCCATGCCCGAGCAGATGAAATTTC 780 

203 GRGANDSTSAMPEQMKFQWI 222 

781 CGGGTGAAACGCACCCATGTGCCCAAGGGTTGGATCAAGGGCAATGCCAATGACATCGGC 840 

223 RVKRTHVPKGWI KGNANDIG 242 
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841 ATGGATT ATG ATT ATGC C C TC CTGG AAC TC AAAAAGC C C C AC AAG AG AAAATTT ATGAAG 900 

243 MDYDYALLELKKPHKRKFMK 262 

901 ATTGGGGTGAGCCCTCCTGCTAAGCAGCTGCCAGGGGGCAGAATTCACTTCTCTGGTTAT 960 

263 IGVS PPAKQLPGGRIHFSGY 282 

961 GACAATGACCGACCAGGCAATTTGGTGTATCGCTTCTGTGACGTCAAAGACGAGACCTAT 1020 

283 DNDRPGNLVYRFCDVKDETY 302 

1021 GACTTGCTCTACCAACAATGCGATTCCCAGCCAGGGGCC AGCGGGTCTGGGGTCTATGTG 1080 

303 DLLYQQCDSQPGASGSGVYV 322 

1081 AGGATGTGGAAG AG AC AAC AC C AG AAGTG GG AGC GG AAAATT ATTGGC ATGATTTC AGGG 1140 

323 RMWKRQHQKWERKIIGMISG 342 

1141 CACCAGTGGGTGGACATGGATGGTTCCCCAC AGGAATTCACACGTGGCTGTTCAGAGATC 1200 

343 HQWVDMDGSPQEFTRGCSEI 362 

1201 ACTCCTCTCCAATATATCCCAGATATATCTATTGGAGTATAAGGAAACTACCTGGATTGT 1260 

363 TPLQYIPDISIGV* 375 

1261 AGGGAGGGGTGACACAGTGTCCCTCCTGCAGCAACTAAGGTCGTCATGTTCTTATTTTAG 1320 

1321 GAGAGGCCAAATTGTTTTTTGTCATTGCCGTGCACACGTGTGTGTGTGTGTGTGTGTGTG 1380 

1381 TGTAAGGTGTCTTATAATCTTTTACCTATTTCTTACAATTGCAAGATGACTGGC 1440 

1441 ATTTGAAAACTGGTTTGTGTATCATATCATATATCATTTAAGCAGTTTGAAGGCATACTT 1500 

1501 TTGCATAGAAATAAAAAAAATACTGATTTGGGGCAATGAGGAATATTTGACA^ 1560 

1561 AATCTTCACGTTTTTGCAAACTTTGATTTTTATTTCATCTGAACTTGT^ 1620 

1621 TATTAAATATTTGGC ATACAAAAAAAAAAAAAAAAAACGGGGGGGCCCGTACCCAATTCG 1680 

1681 CCCTATAGTGAGGCGATAC 1699 
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1 GAATTCGGCAGAGCAGGTGCCCGACATGGCGAGTGTAGTGCTGCCGAGCGGATCCCAGTG 6 0 

1 MASVVLPSGSOC 12 

61 TGCGGCGGCAGCGGCGGCGGCGGCGCCTCCCGGGCTCCGGCTCCGGCTTCTGCTGTTGCT 120 

13 AAAAAAA AP PGLRLRLLL LL 32 

121 CTTCTCCGCCGCGGCACTGATCCCCACAGGTGATGGGCAGAATCTGTTTACGAAAGACGT 180 

33 FSA AALI PTGDG QNLFTKDV 52 

181 GACAGTGATCGAGGGAGAGGTTGCGACC ATCAGTTGCCAAGTCAATAAGAGTGACGACTC 240 

53 TVIEGEVAT I SCQVNKSDDS 72 

241 TGTGATTC AGCTACTGAATCCCMCAGGCAGACCATTTAmCAGGGACTTCAGGCCTTT 300 

73 VIQLLNPNRQTIYFRDFRPL 92 

301 GAAGGACAGCAGGTTTCAGTTGCTGAATTTTTCTAGC AGTGAACTCAAAGTATCATTGAC 360 

93 K0SRFQLLNFSSSELKVSLT 112 

361 AAACGTCTCAATTTCTGATGAAGGAAGATACTTTTGCCAGCTCTATACCGATCCCCCACA 420 

113 NVSISDEGRYFCQLYTDPPQ 132 

421 GGAAAGTTACACCACCATCACAGTCCTGGTCCCACCACGTAATCTGATGATCGATATCCA 480 

133 ESYTTITVLVPPRNLMIDIQ 152 

481 G AAAG AC AC TGC GGTGGAAGGTG AGG AG ATTG AAGTC AAC TG C ACTG CT ATGGC C AGC AA 540 

153 KDTAVEGEE IEVNCTAMASK 172 

541 G C C AGC C ACGACT ATC AGGT GGTTC AAAGGGAAC ACAGAGCT AAAAG GC AAATC GGAGGT 600 

173 PATTIRWFKGNTELKGKSEV 192 

601 GGAAGAGTGGTCAGACATGTACACTGTGACCAGTCAGCTGATGCTGAAGGTGCACAAGGA 660 

193 EEWSDMYTVTSQLMLKVHKE 212 

661 GGACGATGGGGTCCCAGTGATCTGCCT^GGTGGAGCACCCTGCGGTCACTG^AAACCTGCA 720 

213 DDGVPVI CQVEHPAVTGNLQ 232 

721 GACCCAGCGGTATCTAGAAGTACAGTATAAGCCTCAAGTGCACATTCAGATGACTTATCC 780 

233 TQRYLEVQYKPQVHIQMTYP 252 

781 TCT AC AAGGC TT AAC C CGGGAAGGGGACGC GCTTG AGTT AAC ATGTG AAGC C ATCGGGAA 840 

253 liQGLTREGDALELTCEAIGK 272 
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841 GCCCCAGCCTGTGATGGTAACTTGGGTGAGAGTCGATGATGAAATGCCTCAACACGCCGT 900 
273 PQPVMVTWVRVDDEMPQHAV 292 

901 ACTGTCTGGGCCCAACCTGTTCATCAATAACCTAAACAAAACAGATAATGGTACATACCG 960 
293 LSGPNLFINNLNKTDNGTYR 312 

961 CTGTGAAGCTTCAAACATAGTGGGGAAAGCTCACTCGGATTATATGCTGTATGTATACGA 1020 
313 CEASNIVGKAHSDYMLYVYD 332 

1021 TCCCCCCACAACTATCCCTCCTCCCACAACAACCACCACCACCACCACCACCACCACCAC 1080 
333 P P T T I PPPTTTTTTTTTTTT 352 

1081 CACCATCCTTACCATCATCACAGATTCCCGAGCCAGGTGAAGAAGGCTCGATCAGGGCAG 1140 

353 T I L T I I T D S R A R * 355 

1141 TGGATCATGCCGTGATCGGTGGCGTCGTGGCGGTGGTGGTGTTCGCCATGCTGTGCTTGC 1200 

1201 TCATCATTCTGGGGCGCTATTTTGCCAGACATAAAGGTACATACTTCACTCATGAAGCCA 1260 

1261 AAGGAGCCGATGACGCAGCAGACGCAGACACAGCTATAATCAATGCAGAAGGAGGACAGA 1320 

1321 ACAACTCCGAAGAAAAGAAAGAGTACrrCATCTAGATCAGCCTTTTTGTTTCAATGAGGT 1380 

1381 GTCCAACTGGCCCTATTTAGATCATAAAGAGACAGTGATATTGGAACTTGCGAGAAATTC 1440 

1441 GTGTGTTTTTTTATGAATGGGTGGAAAGGTCTGAGACTGGGAAGGCTTOKATTTGCTC 1500 
1501 GTAAAAAAAAAAAAAAAAAA 1520 
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1 GGCACGAGCCGGACCCTGCCGCCCTGCCACTATGTCCCGCCGCTCTATGCTGCTTGCCTG 6 0 

1 MSRRSMLLAW 10 

6 1 GGCTCTCCCCAGCCTCCTTCGACTCGGAGCGGCTCAGGAGACAGAAGACCCGGCCTGCTG 120 

11 ALPSLLRLGAA QETEDPACC 30 

121 C AGCCC C ATAGTGC CCCGGAACGAGTGGAAGGCC CTGGCATC AGAGTGCGCCC AGCACCT 180 

31 S P I VPRMEWK ALASECAOHT. 50 

181 GAGCCTGCCCTTACGCTATGTGGTGGTATCGCACACGGCGGGCAGCAGCTGCAACACCCC 240 

51 SLPLRYVVVSHT AGSSCMTP 70 

241 CGCCTCGTGCCAGCAGCAGGCCCGGAATGTGCAGCACTACCACATGAAGACACTGGGCTG 300 

71 ASCOOOAPNVnH YHMKTT, GW 90 

3 01 GTGCGACGTGGGCTACAACTTCCTGATTGGAGAAGACGGGCTCGTATACGAGGGCCGTGG 360 

91 CDVGYMFT. TGRn GLVYISGRC U0 

361 CTGGAACTTCACGGGTGCCCACTCAGGTCACTTATGGAACCCCATGTCC ATTGGCATCAG 420 

111 W N F T G A H SGHLWNPMSIGIS 130 

421 CTTCATGGGC AACTACATGGATCGGGTGCCC ACACCCCAGGCC ATCC GGGCAGC CCAGGG 480 

131 FMGNYMDRVPTPQAI RAAQG 150 

481 TCTACTGGCCTGCGGTGTGGCTCAGGGAGCCCTGAGGTCCAACTATGTGCTCAAAGGACA 54 0 

151 LLACGVAQGALRSNYVLKGH 170 

541 CCGGGATGTGCAGCGTACACTCTCTCCAGGCAACCAGCTCTACCACCTCATCCAGAATTG 600 

171 RDVQRTLS PGNQLYHLIQNW 190 

601 GCCACACTACCGCTCCCCCTGAGGCCCTGCTGATCCGCACCCCATTCCTCCCCTCCCATG 660 

191 P H Y R S P * 197 

661 GCCAAAAACCCCACTGTCTCCTTCTCCAATAAAGATGTAGCTCAAAAAAAAAAAAAAAAA 720 

721 AAAAAA 726 
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Decoration •Decoration #1': Shade (with solid black) residues that natch the Consensus 
exactly. 
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■ Alpha, Regions - Gamier-Robson 
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B Hydropttfcity Plot - Kyte-Doolirtle 



□ Hydropnobfcity Plot • Hopp-Woods 
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<110> Human Genome Sciences, Inc, 



<120> 12 Human Secreted Proteins 



<130> PF489 



<140> Unassigned 
<141> 1999-10-26 



<150> 60/105,971 
<151> 1998-10-28 



<160> 147 



<170> Patentln Ver. 2.0 



<210> 1 

<211> 733 

<212> DNA 

<213> Homo sapiens 



<400> 1 

gggatccgga gcccaaatct tctgacaaaa ctcacacatg cccaccgtgc ccagcacctg 60 

aattcgaggg tgcaccgtca gtcttcctct tccccccaaa acccaaggac accctcatga 120 

tctcccggac tcctgaggtc acatgcgtgg tggtggacgt aagccacgaa gaccctgagg 180 

tcaagttcaa ctggtacgtg gacggcgtgg aggtgcataa tgccaagaca aagccgcggg 240 

aggagcagta caacagcacg taccgtgtgg tcagcgtcct caccgtcctg caccaggact 300 

ggctgaatgg caaggagtac aagtgcaagg tctccaacaa agccctccca acccccatcg 360 

agaaaaccat ctccaaagcc aaagggcagc cccgagaacc acaggtgtac accctgcccc 420 

catcccggga tgagctgacc aagaaccagg tcagcctgac ctgcctggtc aaaggcttct 480 

atccaagcga catcgccgtg gagtgggaga gcaatgggca gccggagaac aactacaaga 540 

ccacgcctcc cgtgctggac tccgacggct ccttcttcct ctacagcaag ctcaccgtgg 600 

acaagagcag gtggcagcag gggaacgtct tctcatgctc cgtgatgcat gaggctctgc 660 

acaaccacta cacgcagaag agcctctccc tgtctccggg taaatgagtg cgacggccgc 720 

gactctagag gat 733 



<210> 2 

<211> 5 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> Site 
<222> (3) 

<223> Xaa equals any of the twenty naturally ocurring L-amino acids 
<400> 2 

Trp Ser Xaa Trp Ser 
1 5 



<210> 3 

<211> 86 

<212> DNA 

<213> Homo sapiens 
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<400> 3 

gcgcctcgag atttccccga aatctagatt tccccgaaat gatttccccg aaatgatttc 60 
cccgaaatat ctgccatctc aattag 86 

<210> 4 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 4 

gcggcaagct ttttgcaaag cctaggc 27 

<210> 5 

<211> 271 

<212> DNA 

<213> Homo sapiens 

<400> 5 

ctcgagattt ccccgaaatc tagatttccc cgaaatgatt tccccgaaat gatttccccg 60 

aaatatctgc catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc 120 

gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat 180 

ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt 240 

ttttggaggc ctaggctttt gcaaaaagct t 271 



<210> 6 

<211> 32 

<212> DNA 

<213> Homo sapiens 

<400> 6 

gcgctcgagg gatgacagcg atagaacccc gg 

<210> 7 

<211> 31 

<212> DNA 

<213> Homo sapiens 

<400> 7 

gcgaagcttc gcgactcccc ggatccgcct c 

<210> 8 

<211> 12 

<212> DNA 

<213> Homo sapiens 

<400> 8 
ggggactttc cc 

<210> 9 

<211> 73 

<212> DNA 

<213> Homo sapiens 
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<400> 9 

gcggcctcga ggggactttc ccggggactt tccggggact ttccgggact ttccatcctg 60 
ccatctcaat tag 7 o 



<210> 10 

<211> 256 

<212> DNA 

<213> Homo sapiens 

<400> 10 

ctcgagggga ctttcccggg gactttccgg ggactttccg ggactttcca tctgccatct 60 

caattagtca gcaaccatag tcccgcccct aactccgccc atcccgcccc taactccgcc 120 

■cagttccgcc cattctccgc cccatggctg actaattttt tttatttatg cagaggccga 180 

ggccgcctcg gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg 240 

cttttgcaaa aagctt 5 c fi 



<210> 11 

<211> 2609 

<212> DNA 

<213> Homo sapiens 

<400> 11 

gcgtccgggc caaaatgctg agaacgtcca ctcctaatct gtgtggtggt ctgcattgcc 60 

gggccccctg gctctcttct ggcattctct gcctctgcct catattcttg ttaggccagg 120 

tgggcttgct gcagggacac ccccagtgcc tggattacgg gccccctttc cagccccctc 180 

tgcaccttga gttttgctct gactatgagt ccttcggctg ctgtgatcag cacaaggacc 240 

gccgcatcgc tgcccggtac tgggacatca tggaatattt tgatctgaag agacatgagc 300 

tgtgtggaga ttacattaaa gacatccttt gccaggagtg ctcgccctac gcagcccact 360 

ctacgacgcc gaaaacaccc agacgcctct ccggaatctc ccgggcctct gctctgatta 420 

ctgctctgcc ttccattcta actgtcactc agccatttcc ctgctgacca atgaccgcgg 480 

cctccaggag tctcatggaa gggacggtac ccgcttctgc cacctcctgg accttcctga 540 

caaggactat tgcttcccta atgtcctgag gaacgactat ctcaaccgcc acctgggcat 600 

ggtggcccaa gatcctcagg gctgcctgca gctctgcctg agcgaggtgg ccaacgggct 660 

gaggaacccc gtctccatgg tccatgctgg ggacggcacc catcgcttct ttgttgccga 720 

gcaggtagga gtggtgtggg tctacctccc tgatgggagt cgcctggagc aacccttcct 780 

ggacctcaag aacatcgtgt tgaccacccc atggatcggg gatgagagag gcttcttggg 840 

gttggctttt caccccaaat tccgccacaa tcgcaagttc tatatttatt attcgtgcct 900 

ggacaagaag aaggtagaaa agatccgaat tagtgagatg aaggtttctc gggctgatcc 960 

taacaaagct gacctgaaat cagagagggt catcttggag attgaagaac cagcctcaaa 1020 

ccataatggc ggacaacttc tttttggcct ggatggctat atgtacatat tcactgggga 1080 

cgggggacag gctggagatc cctttggcct gtttggaaat gctcagaaca aaagttccct 1140 

gctgggaaaa gttttaagga tcgatgtgaa cagggcaggc tcacatggca agcggtaccg 1200 

agtcccctcg gacaatccat ttgtttctga gccaggggcc caccccgcca tctatgccta 1260 

tgggatcagg aacatgtggc gttgtgctgt ggaccgaggg gaccccatca cgcgccaggg 1320 

ccgaggccgg atattctgtg gggacgtggg ccagaacagg tttgaagagg ttgacctcat 1380 

tttgaaaggt ggaaactatg gctggagagc aaaggaaggg tttgcatgtt atgacaaaaa 1440 

actttgtcac aatgcctctt tggatgatgt tctgccaatc tatgcttatg gccatgcagt 1500 

ggggaagtca gtcactggag gttatgtcta tcgtggttgt gaatccccaa atctcaatgg 1560 

cctgtatatc tttggagact tcatgagtgg tcgacttatg gctttgcagg aagatagaaa 1620 

aaacaagaaa tggaagaagc aggatctttg cctgggcagc accacgtcct gtgccttccc 1680 

agggctgatc agcacccata gcaagttcat catctccttt gctgaagatg aagcagggga 1740 

gctgtatttc ctggcgacct cttacccaag tgcctatgca ccacgtggat ctatttacaa 1800 

gtttgttgac ccctcaaggc gagcaccccc aggcaagtgc aaatacaagc cagtgcccgt 1860 

gagaaccaag agtaagcgga tcccgttcag accactcgcc aagacagtct tggacttgct 1920 

aaaggaacaa tcagagaaag ctgctagaaa atcttccagt gcaaccttag cttctggccc 1980 
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agcccagggt ttgtctgaga aaggctcctc caagaagctg gcttctccta caagcagcaa 2040 

gaatacattg cgagggcctg gtacaaagaa gaaagccaga gtggggcccc acgtccgcca 2100 

gggcaagagg aggaagagcc tgaaaagcca cagtggcagg atgaggccat cagcagagca 2160 

gaagcgagct ggcagaagtc tcccttgacc tattggtcaa ggtggccgac agggtgacgt 2220 

gagagaggag agccacctca tcaaatgaaa gtcactgctg aataaagacc ttagaagtct 2280 

gggaagccag ggtagaggtg gggcagggcg gttttcctct ccctgggaaa tcttgctgtc 2340 

tactgaataa ataaatgcac cttctctgta tgcagtgctt ctgtgggaga ccatatccca 2400 

gattgctggt gcacctgggt tatggtaagc actatccatg agcctgcttg gaatcacact 2460 

ggatgtctcc gttttgtctt gtaaatgcct acaacctgag gtaataaatc aacatttgct 2520 

caaactggca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2580 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 9fiHQ 



<210> 12 

<211> 2499 

<212> DNA 

<213> Homo sapiens 

<400> 12 

caccagcacc ccgcccagag cagtgccgct gcccaaatcc tcgcaggcag ctcatcaacg 60 

caattgcaac tccggctgga gccccggacc tgcaagcctg ggtgtccgtg ggtccgtctg 120 

cccagccatc tgctggtggc acctctccct cctgccgcct ccctcggtga accccacctt 180 

gcagaagtgc agctcgcccg gagcagccca ggagctcagc atgcgtcccc caggcttcag 240 

gaacttcttg ctgctggcgt cctcccttct ctttgctggg ttgtcagctg ttcctcaaag 300 

cttctcgcca tctctgagga gctggccggg cgccgcctgc aggctgtccc gggccgagtc 360 

ggagcgacgc tgccgcgcac ctgggcagcc cccgggggcc gcgctgtgcc acggccgggg 420 

ccgctgcgac tgcggcgtct gcatctgcca cgtgactgag ccgggcatgt tcttcgggcc 480 

cctgtgtgag tgccatgagt gggtgtgcga gacctacgac gggagcacct gtgcaggcca 540 

tggtaagtgt gactgtggca agtgcaagtg tgaccaggga tggtatgggg atgcttgcca 600 

gtacccaact aactgtgact tgacaaagaa gaaaagtaac caaatgtgca agaattcaca 660 

agacatcatc tgctctaatg caggtacatg tcactgtggc aggtgtaagt gtgataattc 720 

agatggaagt ggacttgtgt atggtaaatt ttgtgagtgt gacgatagag aatgcataga 780 

cgatgaaaca gaagaaatat gtggaggcca tgggaagtgt tactgtggaa actgctactg 840 

caaggctggt tggcatggag ataaatgtga attccagtgc gatatcaccc cctgggaaag 900 

caagcgaaga tgcacgtctc cagatggcaa aatctgcagt aacagaggga cttgtgtatg 960 

tggtgaatgt acctgtcacg atgttgatcc gactggggac tggggagata ttcatgggga 1020 

cacctgtgaa tgtgatgaga gggactgtag agctgtctat gaccgatatt ctgatgactt 1080 

ctgttcaggt catggacagt gtaattgcgg aagatgtgac tgcaaagcag gctggtatgg 1140 

gaagaagtgt gagcacccac agtcctgcac gctgtcagct gaggagagca tcaggaagtg 1200 

ccagggaagc tcggatctgc cttgctctgg gaggggtaaa tgtgaatgtg gcaaatgcac 1260 

ctgctatcct ccaggagatc gccgggtgta tggcaagact tgtgagtgtg atgatcgccg 1320 

ctgtgaagac ctcgatggtg tggtctgtgg aggccacggc acatgttcct gtggtcgctg 13 80 

tgtttgtgag agaggatggt ttggaaagct ctgccaacat ccgcggaagt gtaacatgac 1440 

ggaagaacaa agcaagaatc tgtgtgaatc agcagatggc atattgtgct cggggaaggg 1500 

ttcttgtcat tgtgggaagt gcatttgttc tgctgaagag tggtatattt ctggggagtt 1560 

ctgtgactgt gatgacagag actgcgacaa acatgatggt ctcatttgta cagggaatgg 1620 

aatatgtagc tgtggaaact gtgaatgctg ggatggatgg aatggaaatg catgtgaaat 1680 

ctggcttggc tcagaatatc cttaacaatt acatgagaga ggtctggatt cttatttttt 1740 

ctgggccatt agaacatata aatgcgaagg aaaccatgta tattcaccac taggacaggt 1800 

taaaaagacc attgtatgtt tttctatttc tgaattacga atgaaatccg agtacctatt 1860 

agaaatgagt tatgcaaatt tagatgcaaa taacattaga aaaaaaagat tcttccataa 1920 

ttaacataag tggttcctaa cgagagcaat ttttccaccc aaaagtcatt tggcaacatc 1980 

tacagacaat tttgattgtc acactgggtc gggtaggaag gtatgctgca gacatttggt 2040 

gggtagaggc cagggatgct gctgagcatc ccgcagtgta caggacagcc cccaaacaag 2100 

gaattatcca gccccaaatg ccaatagggc tcagactgag aaacattgag ttatatggct 2160 

attagaaatc cacattctta cacaagaaag accatattag aatctaagga aaacatgcat 2220 

attcacatta attaatcgat cagatttttc cagaattccg tatcagtcac cattttaata 2280 

tggggacaat gaagacaagc acacaggagg tagaatatca gagtggggct ggatcaaggg 2340 
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caaaaactgg tcattaagtc atctgacatt aaatcattta gccactaagt tatttgtgta 2400 

ctctcacttt aaactcacca aagaagattc tcttaaagaa attatgaaaa atgtacaatt 2460 

taacatttta aataaatagt gacagaagtt gtttaaaaa 2499 



<210> 13 

<211> 1339 

<212> DNA 

<213> Homo sapiens 



<400> 13 

ggcacgagag cgtgacccag ctgcggccgg ccagccatgg agactggagc gctgcggcgc 60 

ccgcaacttc tcccgttgct gctgctgctc tgcggtgggt gtcccagagc aggcggctgc 120 

aacgagacag gcatgttgga gaggctgccc ctgtgtggga aggctttcgc agacatgatg 180 

ggcaaggtgg acgtctggaa gtggtgcaac ctgtccgagt tcatcgtgta ctatgagagt 240 
ttcaccaact gcaccgagat ggaggccaat gtcgtgggct gctactggcc caaccccctg 
gcccagggct tcatcaccgg catccacagg cagttcttct ccaactgcac cgtggacagg 
"gtccacttgg aggacccccc agacgaggtt ctcatcccgc tgatcgttat acccgtcgtt 
"ctgactgtcg ccatggctgg cctggtggtg tggcgcagca aacgcaccga cacgctgctg 

tgagggtccc ggtgagatgg agtgggtcac acctggcaag ctggaagaaa gttccctggg 540 

gatgggagag cgggtgggtg ctgccaatct ccagctactg tggccacacc ccacctggtc 600 

atgggcagac ccctcccttc ctgggctgac ctgctccctc gaggccagcc tgctccctgg 660 

ctgaggctca ggctatccgc ccaagctctt tgctcattct agggccagtg gaggaaaatg 720 

tgataaggcc agagcttgtg tgctgggcac agaaatcacc tgctgcatcc tgtgctccgc 780 
agctgggccg gacctctgcc cgcaggtttc tatgctgttt cttagcacag aatccagcct 
agccttagcc gcagtctaag ccctgcttgg actaggactc cttgcttgac cccatctctg 
gttcctgccc tggctcctgc accagcccca gctcctgcct acatccaggc agaaagatag 

caggggctct tggaagacgt tccgtgctgt gacctccgag ccctcctggt gggaagacag 1020 

ctggaaaggc tgggaggaga agggaggggt tgggggttcc caggagccat gcgtggcctg 1080 

cagagtccat tccatcatga tgctgtgccc gctatgggct gtgtccatga ccagaggctg 1140 

gagtgggggt gtgttagagc ccctcaccgg gacttgctgt gcggatgggg cctgggcctc 1200 

cttcctacag gggctcctct gtgggtgagg ggccctctgg aatggcatcc catgagcttg 1260 

tggcctctat ctgctaccat ctgtgtttta tctgagtaaa gttaccttac ttctggaaaa 1320 

aaaaaaaaaa aaaaaaaaa 1339 



300 
360 
420 
480 



840 
900 
960 



<210> 14 
<211> 1389 
~'«212> DNA 
'<213> Homo sapiens 

v;t<400> 14 

ggcacgagga cagcctccac cagagtcccc acctttctgg aagctgcagg gctctccatc ' 60 

caggatccag aagcattgaa ggggaccagc cgctgaaggg attctcagtc ccatctgact 120 

ccccatgagg ctcctggctt tcctgagtct gctggccttg gtgctgcagg agacagggac 180 

agcttctctc ccaaggaagg agaggaagag gagagaggag cagatgccca gggaaggcga 240 

ttcctttgaa gttctgcctc tgcggaatga tgtcctgaac ccagacaact atggtgaagt 300 

cattgacctg agcaactatg aggagctcac agattatggg gaccaactcc ccgaggttaa 360 

ggtgactagc ctcgctcctg caaccagcat cagtcccgcc aagagcacta cggctccagg 420 

gacaccctcg tcaaacccca cgatgaccag acctactaca gcagggctgc tactgagttc 480 

ccagcccaac catggtctgc ccacctgcct ggtctgcgtg tgcctcggtt cctctgtgta 540 

ttgcgatgac attgacctag aggacattcc tcctcttcct cggaggactg cctacctgta 600 

tgcacgcttc aaccgcatca gccgtatcag ggccgaagac ttcaaagggc tgacaaagtt 660 

gaagaggatt gacctctcca acaacctcat ttcctccatc gataatgatg ccttccgcct 720 

gctacatgcc ctccaggacc tcatcctccc agagaaccag ttggaagctc tgcccgtgct 780 

gcccagtggc attgagttcc tggatgtccg cctaaatcgg ctccagagct cggggataca 840 

gcctgcagcc ttcagggcaa tggagaagct gcagttcctt tacctgtcag acaacctgct 900 

ggattctatc ccggggcctt tgcccccgag cctgcgctct gtacacctgc agaataacct 960 
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gatagagacc atgcagagag acgtcttctg tgaccccgag gagcacaaac acacccgcag 1020 

gcagctggaa gacatccgcc tggatggcaa ccccatcaac ctcagcctct tccccagcgc 1080 

ctacttctgc ctgcctcggc tccccatcgg ccgcttcacg tagctcggag cccttccact 1140 

cctcccaggt catctcttgg accagcgggc atcacattct ccagcagccg ccatctcaca 1200 

cgcctccctc ctgtggccgc cggcagcatg gacaaaggtc tccatgcagg gggaggaggc 1260 

ctgcttcttt ccccacagct ctcacgtctc ccttctccct gcgggtgaca aagaagccca 1320 

aggaccacct ccttcctgcc tcattgtaat aaaattcccc acactgaaaa aaaaaaaaaa 1380 

aaaaaaaaa 1389 



<210> 15 

<211> 2295 

<212> DNA 

<213> Homo sapiens 

<400> 15 

ccacgcgtcc gagagaacag gcctgtctca ggcaggccct gcgcctccta tgcggagatg 60 

ctactgccac tgctgctgtc ctcgctgctg ggcgggtccc aggctatgga tgggagattc 120 

tggatacgag tgcaggagtc agtgatggtg ccggaggcct gtgacatctc tgtgccctgc 180 

tctttctcct acccccgaca agactggaca gggtctaccc cagcttatgg ctactggttc 240 

aaagcagtga ctgagacaac caagggtgct cctgtggcca caaaccacca gagtcgagag 300 

gtggaaatga gcacccgggg ccgattccag ctcactgggg accccgccaa ggggaactgc 360 

tccttggtga tcagagacgc gcagatgcag gatgagtcac agtacttctt tcgggtggag 420 

agaggaagct atgtgagata taatttcatg aacgatgggt tctttctaaa agtaacagtg 480 

ctcagcttca cgcccagacc ccaggaccac aacaccgacc tcacctgcca tgtggacttc 540 

tccagaaagg gtgtgagcgc acagaggacc gtccgactcc gtgtggccta tgcccccaga 600 

gaccttgtta tcagcatttc acgtgacaac acgccagccc tggagcccca gccccaggga 660 

aatgtcccat acctggaagc ccaaaaaggc cagttcctgc ggctcctctg tgctgctgac 720 

agccagcccc ctgccacact gagctgggtc ctgcagaaca gagtcctctc ctcgtcccat 780 

ccctggggcc ctagacccct ggggctggag ctgcccgggg tgaaggctgg ggattcaggg 840 

cgctacacct gccgagcgga gaacaggctt ggctcccagc agcgagccct ggacctctct 900 

gtgcagtatc ctccagagaa cctgagagtg atggtttccc aagcaaacag gacagtcctg 960 

gaaaaccttg ggaacggcac gtctctccca gtactggagg gccaaagcct gtgcctggtc 1020 

tgtgtcacac acagcagccc cccagccagg ctgagctgga cccagagggg acaggttctg 1080 

agcccctccc agccctcaga ccccggggtc ctggagctgc ctcgggttca agtggagcac 1140 

gaaggagagt tcacctgcca cgctcggcac ccactgggct cccagcacgt ctctctcagc 1200 

ctctccgtgc actactcccc gaagctgctg ggcccctcct gctcctggga ggctgagggt 1260 

ctgcactgca gctgctcctc ccaggccagc ccggccccct ctctgcgctg gtggcttggg 1320 

gaggagctgc tggaggggaa cagcagccag gactccttcg aggtcacccc cagctcagcc 1380 

gggccctggg ccaacagctc cctgagcctc catggagggc tcagctccgg cctcaggctc 1440 

cgctgtgagg cctggaacgt ccatggggcc cagagtggat ccatcctgca gctgccagat 1500 

aagaagggac tcatctcaac ggcattctcc aacggagcgt ttctgggaat cggcatcacg 1560 

gctcttcttt tcctctgcct ggccctgatc atcatgaaga ttctaccgaa gagacggact 1620 

cagacagaaa ccccgaggcc caggttctcc cggcacagca cgatcctgga ttacatcaat 1680 

gtggtcccga cggctggccc cctggctcag aagcggaatc agaaagccac accaaacagt 1740 

cctcggaccc ctcttccacc aggtgctccc tccccagaat caaagaagaa ccagaaaaag 1800 

cagtatcagt tgcccagttt cccagaaccc aaatcatcca ctcaagcccc agaatcccag 1860 

gagagccaag aggagctcca ttatgccacg ctcaacttcc caggcgtcag acccaggcct 1920 

gaggcccgga tgcccaaggg cacccaggcg gattatgcag aagtcaagtt ccaatgaggg 1980 

tctcttaggc tttaggactg ggacttcggc taggaggaag gtagagtaga agttgaagat 2040 

aaccgagtgc aagagtttcc ttctctccct ctctctctct ctttctctct ctctctctct 2100 

ttctctctct ttttaaaaaa actctggcca cggcacagtg gctcacgcct gtaatcccag 2160 
cactttggga agttgaagtt gggccaaatc ccctgagtcc ggattcgaaa acagcctggc . 2220 

caacttgtga aaacccgtct ctacttaaaa atacccaaat tacctgggca ttgtggcagg 2280 

ggcctgttta tcctt 2295 



<210> 16 
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<211> 1748 

<212> DNA 

<213> Homo sapiens 

<400> 16 

agacgttccc tcgcggccct ggcacctcca accccagata tgctgctgct gctgctgctg 60 

cccctgctct gggggaggga gagggtggaa ggacagaaga gtaaccggaa ggattactcg 120 

ctgacgatgc agagttccgt gaccgtgcaa gagggcatgt gtgtccatgt gcgctgctcc 180 

ttctcctacc cagtggacag ccagactgac tctgacccag ttcatggcta ctggttccgg 240 

gcagggaatg atataagctg gaaggctcca gtggccacaa acaacccagc ttgggcagtg 300 

caggaggaaa ctcgggaccg attccacctc cttggggacc cacagaccaa aaattgcacc 360 

ctgagcatca gagatgccag aatgagtgat gcggggagat acttctttcg tatggagaaa 420 

ggaaatataa aatggaatta taaatatgac cagctctctg tgaacgtgac agccttgacc 480 

cacaggccca acatccttat ccccggtacc ctggagtctg gctgcttcca gaatctgacc 540 

tgctctgtgc cctgggcctg tgagcagggg acgcccccta tgatctcctg gatggggacc 600 

tctgtgtccc ccctgcaccc ctccaccacc cgctcctcag tgctcaccct catcccacag 660 

ccccagcacc acggcaccag cctcacctgt caggtgacct tgcctggggc cggcgtgacc 720 

acgaacagga ccatccaact caatgtgtcc taccctcctc agaacttgac tgtgactgtc 780 

ttccaaggag aaggcacagc atccacagct ctggggaaca gctcatctct ttcagtccta 840 

gagggccagt ctctgcgctt ggtctgtgct gttgacagca atccccctgc caggctgagc 900 

tggacctgga ggagtctgac cctgtacccc tcacagccct caaaccctct ggtactggag 960 

ctgcaagtgc acctggggga tgaaggggaa ttcacctgtc gagctcagaa ctctctgggt 1020 

tcccagcacg tttccctgaa cctctccctg caacaggagt acacaggcaa aatgaggcct 1080 

gtatcaggag tgttgctggg ggcggtcggg ggagctggag ccacagccct ggtcttcctc 1140 

tccttctgtg tcatcttcat tgtagtgagg tcctgcagga agaaatcggc aaggccagca 1200 

gcggacgtgg gagacatagg catgaaggat gcaaacacca tcaggggctc agcctctcag 1260 

ggtaacctga ctgagtcctg ggcagatgat aacccccgac accatggcct ggctgcccac 1320 

tcctcagggg aggaaagaga gatccagtat gcacccctca gctttcataa gggggagcct 1380 

caggacctat caggtcaaga agccaccaac aatgagtact cagagatcaa gatccccaag 1440 

taagaaaatg cagaggctcg ggcttgtttg agggttcacg acccctccag caaaggagtc 1500 

tgaggctgat tccagtagaa ttagcagccc tcaatgctgt gcaacaagac atcagaactt 1560 

attcctcttg tctaactgaa aatgcatgcc tgatgaccaa actctccctt nccccatcca 1620 

atcggtccac actccccgcc ctggcctctg gtacccacca ttctcctctg tacttctcta 1680 

aggatgacta ctttagattc cgaatatagt gagattgtaa cgtgaaaaaa aaaaaaaaaa 1740 

aaaaaaaa 1748 



, <210> 17 
<211> 4995 
<212> DNA 
<213> Homo sapiens 

<400> 17 

gccgcgccga ggaggctgcc gctctggctt gccgcccccc gccgccgctg cacaccggac 60 

ccagccgccg tgccgcgggc catggacctg cccaggggcc tggtggtggc ctgggcgctc 120 

agcctgtggc cagggttcac ggacaccttc aacatggaca ccaggaagcc ccgggtcatc 180 

cctggctcca ggaccgcctt ctttggctac acagtgcagc agcacgacat cagtggcaat 240 

aagtggctgg tcgtgggcgc cccactggaa accaatggct accagaagac gggagacgtg 300 

tacaagtgtc cagtgatcca cgggaactgc accaaactca acctgggaag ggtcaccctg 360 

tccaacgtgt ccgagcggaa agacaacatg cgcctcggcc ttagtctcgc caccaacccc 420 

aaggacaaca gcttcctggc ctgcagcccc ctctggtctc atgagtgtgg gagctcctac 480 

tacaccacag ggatgtgttc aagagtcaac tccaacttca ggttctccaa gaccgtggcc 540 

ccagctctcc aaaggtgcca gacctacatg gacatcgtca ttgtcctgga tggctccaac 600 

agcatctacc cctgggtgga ggttcagcac ttcctcatca acatcctgaa aaagttttac 660 

attggcccag ggcagatcca ggttggagtt gtgcagtatg gcgaagatgt ggtgcatgag 720 

tttcacctca atgactacag gtctgtaaaa gatgtggtgg aagctgccag ccacattgag 780 

cagagaggag gaacagagac ccggacggca tttggcattg aatttgcacg ctcagaggct 840 

ttccagaagg gtggaaggaa aggagccaag aaggtgatga ttgtcatcac agatggggag 900 
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tcccacgaca gcccagacct ggagaaggtg atccagcaaa gcgaaagaga caacgtaaca 960 

agatatgcgg tggccgtcct gggctactac aaccgcaggg ggatcaatcc agaaaccttt 1020 

ctaaatgaaa tcaaatacat cgccagtgac cctgatgaca agcacttctt caatgtcact 1080 

gatgaggctg ccttgaagga cattgtcgat gccctggggg acagaatctt cagcctggaa 1140 

ggcaccaaca agaacgagac ctcctttggg ctggagatgt cacagacggg cttttcctcg 1200 

cacgtggtgg aggatggggt tctgctggga gccgtcggtg cctatgactg gaatggagct 1260 

gtgctaaagg agacgagtgc cgggaaggtc attcctctcc gcgagtccta cctgaaagag 1320 

ttccccgagg agctcaagaa ccatggtgca tacctggggt acacagtcac atcggtcgtg 1380 

tcctccaggc aggggcgagt gtacgtggcc ggagcccccc ggttcaacca cacgggcaag 1440 

gtcatcctgt tcaccatgca caacaaccgg agcctcacca tccaccaggc tatgcggggc 1500 

cagcagatag gctcttactt tgggagtgaa atcacctcgg tggacatcga cggcgacggc 1560 

gtgactgatg tcctgctggt gggcgcaccc atgtacttca acgagggccg tgagcgaggc 1620 

aaggtgtacg tctatgagct gagacagaac cggtttgttt ataacggaac gctaaaggat 1680 

tcacacagtt accagaatgc ccgatttggg tcctccattg cctcagttcg agacctcaac 1740 

caggattcct acaatgacgt ggtggtggga gcccccctgg aggacaacca cgcaggagcc 1800 

atctacatct tccacggctt ccgaggcagc atcctgaaga cacctaagca gagaatcaca 1860 

gcctcagagc tggctaccgg cctccagtat tttggctgca gcatccacgg gcaattggac 1920 

ctcaatgagg atgggctcat cgacctggca gtgggagccc ttggcaacgc tgtgattctg 1980 

tggtcccgcc cagtggttca gatcaatgcc agcctccact ttgagccatc caagatcaac 2040 

atcttccaca gagactgcaa gcgcagtggc agggatgcca cctgcctggc cgccttcctc 2100 

tgcttcacgc ccatcttcct ggcaccccat ttccaaacaa caactgttgg catcagatac 2160 

aacgccacca tggatgagag gcggtataca ccgagggccc acctggacga gggcggggac 2220 

cgattcacca acagagccgt actgctctcc cccggccagg agctctgtga gcggatcaac 2280 

ttccatgtcc tggacactgc tgactacgtg aagccagtga ccttctcagt cgagtattcc 2340 

ctggaggacc ctgaccatgg ccccatgctg gacgacggct ggcccaccac tctcagagtc 2400 

tcggtgccct tctggaacgg ctgcaatgag gatgagcact gtgtccctga ccttgtgttg 2460 

gatgcccgga gtgacctgcc cacggccatg gagtactgcc agagggtgct gaggaagcct 2520 

gcgcaggact gctccgcata cacgctgtcc ttcgacacca cagtcttcat catagagagc 2580 

acacgccagc gagtggcggt ggaggccaca ctggagaaca ggggcgagaa cgcctacagc 2640 

acggtcctaa atatctcgca gtcagcaaac ctgcagtttg ccagcttgat ccagaaggag 2700 

gactcagacg gtagcattga gtgtgtgaac gaggagagga ggctccagaa gcaagtctgc 2760 

aacgtcagct atcccttctt ccgggccaag gccaaggtgg ctttccgtct tgattttgag 2820 

ttcagcaaat ccatcttcct acaccacctg gagatcgagc tcgctgcagg cagtgacagt 2880 

aatgagcggg acagcaccaa ggaagacaac gtggccccct tacgcttcca cctcaaatac 2940 

gaggctgacg tcctcttcac caggagcagc agcctgagcc actacgaggt caagctcaac 3000 

agctcgctgg agagatacga tggtatcggg cctcccttca gctgcatctt caggatccag 3060 

aacttgggct tgttccccat ccacgggatt atgatgaaga tcaccattcc catcgccacc 3120 

aggagcggca accgcctact gaagctgagg gacttcctca cggacgaggt agcgaacacg 3180 

tcctgtaaca tctggggcaa tagcactgag taccggccca ccccagtgga ggaagacttg 3240 

cgtcgtgctc cacagctgaa tcacagcaac tctgatgtcg tctccatcaa ctgcaatata 3300 

cggctggtcc ccaaccagga aatcaatttc catctactgg ggaacctgtg gttgaggtcc 3360 

ctaaaagcac tcaagtacaa atccatgaaa atcatggtca acgcagcctt gcagaggcag 3420 

ttccacagcc ccttcatctt ccgtgaggag gatcccagcc gccagatcgt gtttgagatc 3480 

tccaagcaag aggactggca ggtccccatc tggatcattg taggcagcac cctggggggc 3540 

ctcctactgc tggccctgct ggtcctggca ctgtggaagc tcggcttctt tagaagtgcc 3600 

aggcgcagga gggagcctgg tctggacccc acccccaaag tgctggagtg aggctccaga 3660 

ggagactttg agttgatggg ggccaggaca ccagtccagg tagtgttgag acccaggcct 3720 

gtggccccac cgagctggag cggagaggaa gccagctggc tttgcacttg acctcatctc 3780 

ccgagcaatg gcgcctgctc cctccagaat ggaactcaag ctggttttaa gtggaactgc 3840 

cctactggga gactgggaca cctttaacac agacccctag ggatttaaag ggacacccct 3900 

acacacaccc aggcccacgc caaggcctcc ctcaggctct gtggagggca tttgctgccc 3960 

cagctactaa ggtgctagga attcgtaatc atccccatcc tccagagaaa cccagggagg 4020 

aagactgtaa atacgaaccc aatctgcaca ctccaggcct ctagttccag aaggatccaa 4080 

gacaaaacag atctgaattc tgcccttttc tctcacccat cccacccctc cattggctcc 4140 

caagtcacac ccactccctt ccccatagat aggcccctgg ggctcccgaa gaatgaaccc 4200 

aagagcaagg gcttgatggt gacagctgca agccagggat gaagaaagac tctgagatgt 4260 

ggagactgat ggccaggcaa gtgggaccag gatactggac gctgtcctga gatgagaggt 4320 

agccgggctc tgcacccacg tgcattcaca ttgaccgcaa ctcacacatt cccccaccag 4380 
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ctgcagcccc ttgctctcag ctgccaaccc tcccgggtca cttttgttcc caggtacctc 4440 

atgggaagca tgtggatgac acaatccctg gggctgtgca ttcccacgtc ttcttgctgc 4500 

agcctgcccc tagacatgga cgcaccggcc tggctgcagc tgggcagcag gggtaggggt 4560 

agggagcctc ccctccctgt atcaccccct ccctacacac acacacacac acacacacac 4620 

acactgcctc ccatccttcc ctcatgcccg ccagtgcaca gggaagggct tggccagcgc 4680 

tgttgagggg tcccctctgg aatgcactga ataaagcacg tgcaaggact cccggagcct 4740 

gtgcagcctt ggtggcaaat atctcatctg ccggccccca ggacaagtgg tatgaccagt 4800 

gataatgccc caaggacaag gggcgtgcct ggcgcccagt ggagtaattt atgccttagt 4860 

cttgttttga ggtagaaatg caagggggac acatgaaagg catcagtccc cctgtgcata 4920 

gtacgacctt tactgtcgta tttttgaaaa attaaaaata cagtgtttaa aaacaaaaaa 4980 

aaaaaaaaaa aaaaa yiooc 



<210> 18 
^<211> 726 
"<212> DNA 

<213> Homo sapiens 

<400> 18 



ggcacgagcc ggaccctgcc gccctgccac tatgtcccgc cgctctatgc tgcttgcctg 60 
ggctctcccc agcctccttc gactcggagc ggctcaggag acagaagacc cggcctgctg 120 
cagccccata gtgccccgga acgagtggaa ggcccxggca tcagagtgcg cccagcacct 



- -- ~~ 180 

gagcctgccc ttacgctatg tggtggtatc gcacacggcg ggcagcagct gcaacacccc 240 

cgcctcgtgc cagcagcagg cccggaatgt gcagcactac cacatgaaga cactgggctg 300 

gtgcgacgtg ggctacaact tcctgattgg agaagacggg ctcgtatacg agggccgtgg 360 

ctggaacttc acgggtgccc actcaggtca cttatggaac cccatgtcca ttggcatcag 420 

cttcatgggc aactacatgg atcgggtgcc cacaccccag gccatccggg cagcccaggg 480 

tctactggcc tgcggtgtgg ctcagggagc cctgaggtcc aactatgtgc tcaaaggaca 540 

ccgggatgtg cagcgtacac tctctccagg caaccagctc taccacctca tccagaattg 600 

gccacactac cgctccccct gaggccctgc tgatccgcac cccattcctc ccctcccatg 660 

gccaaaaacc ccactgtctc cttctccaat aaagatgtag ctcaaaaaaa aaaaaaaaaa 720 

aaaaaa 726 



<210> 19 

<211> 3059 

<212> DNA 

<213> Homo sapiens 

<400> 19 

ggcacgagct gtcatccgtt tccatgccgt gaggtccatt cacagaacac atccatggct 60 

ctcatgctca gtttggttct gagtctcctc aagctgggat cagggcagtg gcaggtgttt 120 

gggccagaca agcctgtcca ggccttggtg ggggaggacg cagcattctc ctgtttcctg 180 

tctcctaaga ccaatgcaga ggccatggaa gtgcggttct tcaggggcca gttctctagc 240 

gtggtccacc tctacaggga cgggaaggac cagccattta tgcagatgcc acagtatcaa 300 

ggcaggacaa aactggtgaa ggattctatt gcggaggggc gcatctctct gaggctggaa 360 

aacattactg tgttggatgc tggcctctat gggtgcagga ttagttccca gtcttactac 420 

cagaaggcca tctgggagct acaggtgtca gcactgggct cagttcctct catttccatc 480 

gcgggatatg ttgatagaga catccagcta ctctgtcagt cctcgggctg gttcccccgg 540 

cccacagcga agtggaaagg tccacaagga caggatttgt ccacagactc caggacaaac 600 

agagacatgc atggcctgtt tgatgtggag atctctctga ccgtccaaga gaacgccggg 660 

agcatatcct gttccatgcg gcatgctcat ctgagccgag aggtggaatc cagggtacag 720 

ataggagact ggagaagaaa gcacggacag gcaggtaaaa gaaaatattc ctcttcacac 780 

atttatgact cctttccaag tctctcgttt atggattttt atatcctgag gcccgtgggt 840 

ccctgcagag ccaagcttgt gatgggaact ctgaaattgc agattctggg ggaggtgcat 900 

tttgtagaga agccccatag ccttcttcag atctctggag ggtccacaac actcaaaaag 960 

ggtcccaatc cttggtcttt cccttctccc tgcgccctgt ttcccacgtg agcacggaac 1020 
tgcctgctct ctctgcttgc tttcagaatt gagagacgcc cggaaacacg caggtaccaa * 1080 
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cgcctgagag ggtaacagtg ggcatggagt aggaagatga ccagtgacag atatggagcc 1140 

catccagctt gtagacagca aatctgtgat gcccgaatcc accccagggt gcagctgcct 1200 

ctaaatacac ttcttggccc aggacttgga gggaaaagcg tagggactgg gtcagctagg 1260 

aggggtcaca ggcaagacgc cagggaactg agggcattag tagctggctt ctaggggtct 1320 

gtgcaaaggg gaacgaagtg aagttagcag gaactggtgg gtggaaggaa gctgaatcct 1380 

ggagtcactc aaggtctcac aaagtcaaat agagggctta cgtgggaggg cagtggtagg 1440 

gctgggtgaa catctcatgg ttgagcatct ccaagcatca gtgaggcacg ggggctgccc 1500 

tggagaaggt acatggctgg tgggatagtg ggactggccg gatcctaccc ggagccagtc 1560 

tgcagtggga gggtcgacct cttgctccag cccagatttc gtcttcagta actcatgctt 1620 

cctctctccc ccaccgcacc ccagtggagg tgactctgga tccagagacg gctcacccga 1680 

agctctgcgt ttctgatctg aaaactgtaa cccatagaaa agctcctcag gaggtgcctc 1740 

actctgagaa gagatttaca aggaagagtg tggtggcttc tcagggtttc caagcaggga 1800 

aacattactg ggaggtggac gtgggacaaa atgtagggtg gtatgtggga gtgtgtcggg 1860 

atgacgtaga cagggggaag aacaatgtga ctttgtctcc caacaatggg tattgggtcc 1920 

tcagactgac aacagaacat ttgtatttca cattcaatcc ccattttatc agcctccccc 1980 

ccagcacccc tcctacacga gtaggggtct tcctggacta tgagggtggg accatctcct 2040 

tcttcaatac aaatgaccag tcccttattt ataccctgct gacatgtcag tttgaaggct 2100 

tgttgagacc ctatatccag catgcgatgt atgacgagga aaaggggact cccatattca 2160 

tatgtccagt gtcctgggga tgagacagag aagaccctgc ttaaagggcc ccacaccaca 2220 

gacccagaca cagccaaggg agagtgctcc cgacaggtgg ccccagcttc ctctccggag 2280 

cctgcgcaca gagagtcacg ccccccactc tcctttaggg agctgaggtt cttctgccct 2340 

gagccctgca gcagcggcag tcacagcttc cagacgaggg gggattggcc tgaccctgtg 2400 

ggagtcagaa gccatggctg ccctgaagtg gggacggaat agactcacat taggtttagt 2460 

ttgtgaaaac tccatccagc taagcgatct tgaacaagtc acaacctccc aggctcctca 2520 

tttgctagtc acggacagtg attcctgcct cacaggtgaa gattaaagag acaacgaatg 2580 

tgaatcatgc ttgcaggttt gagggccaca gtgtttgcta atggatgtgt ttttatgatt 2640 

atacattttc cccaccataa aactctgttt gccttaattc ccacattaat ttaacttttc 2700 

ctcctatacc caaatccacc catggaatag ttaattggaa cacctgcctt tgtgaggctc 2760 

caaagaataa agaggaggta ggatttttca ctgattctat aagcccagca ttacctgata 2820 

ccaaaaccag gcaaagaaaa cagaagaaga ggaaggaaaa ctacaggtcc atatccctca 2880 

ttaacacaga cacaaaaatt ctaaataaaa ttttaacaaa ttaaactaaa caatatattt 2940 

aaagatgata tataactact cagtgtggtt tgtcccacaa atgcagagtt ggtttaatat 3000 

ttaaatatca accagtgtaa ttcagcacat taataaagta aaaaaaaaaa aaaaaaaaa 3059 



<210> 20 

<211> 1699 

<212> DNA 

<213> Homo sapiens 

<400> 20 

ggcacgaggg ggcaggcatg ggagccgcgc gctctctccc ggcgcccaca cctgtctgag 60 

cggcgcagcg agccgcggcc cgggcgggct gctcggcgcg gaacagtgct cggcatggca 120 

gggattccag ggctcctctt ccttctcttc tttctgctct gtgctgttgg gcaagtgagc 180 

ccttacagtg ccccctggaa acccacttgg cctgcatacc gcctccctgt cgtcttgccc 240 

cagtctaccc tcaatttagc caagccagac tttggagccg aagccaaatt agaagtatct 300 

tcttcatgtg gaccccagtg tcataaggga actccactgc ccacttacga agaggccaag 360 

caatatctgt cttatgaaac gctctatgcc aatggcagcc gcacagagac gcaggtgggc 420 

atctacatcc tcagcagtag tggagatggg gcccaacacc gagactcagg gtcttcagga 480 

aagtctcgaa ggaagcggca gatttatggc tatgacagca ggttcagcat ttttgggaag 540 

gacttcctgc tcaactaccc tttctcaaca tcagtgaagt tatccacggg ctgcaccggc 600 

accctggtgg cagagaagca tgtcctcaca gctgcccact gcatacacga tggaaaaacc 660 

tatgtgaaag gaacccagaa gcttcgagtg ggcttcctaa agcccaagtt taaagatggt 720 

ggtcgagggg ccaacgactc cacttcagcc atgcccgagc agatgaaatt tcagtggatc 780 

cgggtgaaac gcacccatgt gcccaagggt tggatcaagg gcaatgccaa tgacatcggc 840 

atggattatg attatgccct cctggaactc aaaaagcccc acaagagaaa atttatgaag 900 

attggggtga gccctcctgc taagcagctg ccagggggca gaattcactt ctctggttat 960 

gacaatgacc gaccaggcaa tttggtgtat cgcttctgtg acgtcaaaga cgagacctat 1020 
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gacttgctct accaacaatg cgattcccag ccaggggcca gcgggtctgg ggtctatgtg 1080 

aggatgtgga agagacaaca ccagaagtgg gagcggaaaa ttattggcat gatttcaggg 1140 

caccagnggg tggacatgga tggttcccca caggaattca cacgtggctg ttcagagatc 1200 

actcctctcc aatatatccc agatatatct attggagtat aaggaaacta cctggattgt 1260 

agggaggggt gacacagtgt ccctcctgca gcaactaagg tcgtcatgtt cttattttag 1320 

gagaggccaa attgtttttt gtcattgccg tgcacacgtg tgtgtgtgtg tgtgtgtgtg 1380 

tgtaaggtgt cttataatct tttacctatt tcttacaatt gcaagatgac tggctttact 1440 

atttgaaaac tggtttgtgt atcatatcat atatcattta agcagtttga aggcatactt 1500 

ttgcatagaa ataaaaaaaa tactgatttg gggcaatgag gaatatttga caattaagtt 1560 

aatcttcacg tttttgcaaa ctttgatttt tatttcatct gaacttgttt caaagattta 1620 

tattaaatat ttggcataca aaaaaaaaaa aaaaaaacgg gggggcccgt acccaattcg 1680 

ccctatagtg aggcgatac 1699 



'<210> 21 
<211> 1520 
<212> DNA 
<213> Homo sapiens 

<400> 21 

gaattcggca gagcaggtgc ccgacatggc gagtgtagtg ctgccgagcg gatcccagtg 60 

tgcggcggca gcggcggcgg cggcgcctcc cgggctccgg ctccggcttc tgctgttgct 120 

cttctccgcc gcggcactga tccccacagg tgatgggcag aatctgttta cgaaagacgt 180 

gacagtgatc gagggagagg ttgcgaccat cagttgccaa gtcaataaga gtgacgactc 240 

tgtgattcag ctactgaatc ccaacaggca gaccatttat ttcagggact tcaggccttt 300 

gaaggacagc aggtttcagt tgctgaattt ttctagcagt gaactcaaag tatcattgac 360 

aaacgtctca atttctgatg aaggaagata cttttgccag ctctataccg atcccccaca 420 

ggaaagttac accaccatca cagtcctggt cccaccacgt aatctgatga tcgatatcca 480 

gaaagacact gcggtggaag gtgaggagat tgaagtcaac tgcactgcta tggccagcaa 540 

gccagccacg actatcaggt ggttcaaagg gaacacagag ctaaaaggca aatcggaggt 600 

ggaagagtgg tcagacatgt acactgtgac cagtcagctg atgctgaagg tgcacaagga 660 

ggacgatggg gtcccagtga tctgccaggt ggagcaccct gcggtcactg gaaacctgca 720 

gacccagcgg tatctagaag tacagtataa gcctcaagtg cacattcaga tgacttatcc 780 

tctacaaggc ttaacccggg aaggggacgc gcttgagtta acatgtgaag ccatcgggaa 840 

gccccagcct gtgatggtaa cttgggtgag agtcgatgat gaaatgcctc aacacgccgt 900 

actgtctggg cccaacctgt tcatcaataa cctaaacaaa acagataatg gtacataccg 960 

ctgtgaagct tcaaacatag tggggaaagc tcactcggat tatatgctgt atgtatacga 1020 

^ tccccccaca actatccctc ctcccacaac aaccaccacc accaccacca ccaccaccac 1080 

caccatcctt accatcatca cagattcccg agccaggtga agaaggctcg atcagggcag 1140 

tggatcatgc cgtgatcggt ggcgtcgtgg cggtggtggt gttcgccatg ctgtgcttgc 1200 

^.tcatcattct ggggcgctat tttgccagac ataaaggtac atacttcact catgaagcca 1260 

aaggagccga tgacgcagca gacgcagaca cagctataat caatgcagaa ggaggacaga 1320 

acaactccga agaaaagaaa gagtacttca tctagatcag cctttttgtt tcaatgaggt 1380 

gtccaactgg ccctatttag atgataaaga gacagtgata ttggaacttg cgagaaattc 1440 

gtgtgttttt ttatgaatgg gtggaaaggt gtgagactgg gaaggcttgg gatttgctgt 1500 

gtaaaaaaaa aaaaaaaaaa 1520 



<210> 22 

<211> 807 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (803) 

<223> n equals a,t,g, or c 
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<400> 22 



tttttttttt 


tgagacggag 


tctcgctctt 


tcacccaggc 


cagactgaag 


tggcgcagtc 


60 


tcggatcact 


gaaaagctca 


gcctcacggg 


atcacgacca 


ttctcctgcc 


tcagcctccc 


120 


gagtggctgg 


gactaaaggc 


gcccgccacc 


gacgcccgga 


ctaatttttt 


gtatttatag 


180 


tagagacggg 


gtttcaccgt 


gttagccaag 


atggtctcga 


tctcttgaac 


tcgtgatccg 


240 


cccgcgtcag 


cctcccaaag 


tgctgggatt 


acaggcgtga 


gccacagcgc 


ccggcctgta 


300 


aagatttctt 


gagcaaatga 


atgagtaaat 


gaaaggagtg 


ctcaaatttc 


ttttattcta 


360 


aaaaatgttc 


ccctttttta 


gaaaatgctc 


tgtagctttt 


gtaggtcttt 


cctgcactca 


420 


aacatccact 


ccttaccctt 


tccaatctcc 


cttttcttct 


caacccatag 


aatgtacttc 


480 


catgtctacc 


attccaccag 


aactactcta 


accaaggcta 


cccaggatat 


ccttatttct 


540 


ttcaacttct 


ttgacatgct 


cagtcgcatt 


tggcactgtt 


aatgtcttcc 


tcctctttga 


600 


aacacctcct 


ttgcatggca 


ctatcctggt 


tttcttcctt 


catttcagga 


gaaacttcat 


660 


tctccttact 


gaattctttc 


ttcctcccct 


atccatcatc 


tagatgttgt 


tgtttctcag 


720 


tgcagtgttc 


aatcctagac 


cccttttcat 


gtaactcaat 


gcgttttcct 


tgggagaatt 


780 


aattcccttc 


cctggtgtca 


ctntgcc 








807 



<210> 23 

<211> 2522 

<212> DNA 

<213> Homo sapiens 

<400> 23 

gccgctcgac accagcaccc cgcccagagc agtgccgctg cccaaatcct cgcaggcagc 60 

tcatcaacgc aattgcaact ccggctggag ccccggacct gcaagcctgg gtgtccgtgg 120 

gtccgtctgc ccagccatct gctggtggca cctctccctc ctgccgcctc cctcggtgaa 180 

ccccaccttg cagaagtgca gctcgcccgg agcagcccag gagctcagca tgcgtccccc 240 

aggcttcagg aacttcttgc tgctggcgtc ctcccttctc tttgctgggt tgtcagctgt 300 

tcctcaaagc ttctcgccat ctctgaggag ctggccgggc gccgcctgca ggctgtcccg 360 

ggccgagtcg gagcgacgct gccgcgcacc tgggcagccc ccgggggccg cgctgtgcca 420 

cggccggggc cgctgcgact gcggcgtctg catctgccac gtgactgagc cgggcatgtt 480 

cttcgggccc ctgtgtgagt gccatgagtg ggtgtgcgag acctacgacg ggagcacctg 540 

tgcaggccat ggtaagtgtg actgtggcaa gtgcaagtgt gaccagggat ggtatgggga 600 

tgcttgccag tacccaacta actgtgactt gacaaagaag aaaagtaacc aaatgtgcaa 660 

gaattcacaa gacatcatct gctctaatgc aggtacatgt cactgtggca ggtgtaagtg 720 

tgataattca gatggaagtg gacttgtgta tggtaaattt tgtgagtgtg acgatagaga 780 

atgcatagac gatgaaacag aagaaatatg tggaggccat gggaagtgtt actgtggaaa 840 

ctgctactgc aaggctggtt ggcatggaga taaatgtgaa ttccagtgcg atatcacccc 900 

ctgggaaagc aagcgaagat gcacgtctcc agatggcaaa atctgcagta gcagagggac 960 

ttgtgtatgt ggtgaatgta cctgtcacga tgttgatccg actggggact ggggagatat 1020 

tcatggggac acctgtgaat gtgatgagag ggactgtaga gctgtctatg accgatattc 1080 

tgatgacttc tgttcaggtc atggacagtg taattgcgga agatgtgact gcaaagcagg 1140 

ctggtatggg aagaagtgtg agcacccaca gtcctgcacg ctgtcagctg aggagagcat 1200 

caggaagtgc cagggaagct cggatctgcc ttgctctggg aggggtaaat gtgaatgtgg 1260 

caaatgcacc tgctatcctc caggagatcg ccgggtgtat ggcaagactt gtgagtgtga 1320 

tgatcgccgc tgtgaagacc tcgatggtgt ggtctgtgga ggccacggca catgttcctg 1380 

tggtcgctgt gtttgtgaga gaggatggtt tggaaagctc tgccaacatc cgcggaagtg 1440 

taacatgacg gaagaacaaa gcaagaatct gtgtgaatca gcagatggca tattgtgctc 1500 

ggggaagggt tcttgtcatt gtgggaagtg catttgttct gctgaagagt ggtatatttc 1560 

tggggagttc tgtgactgtg atgacagaga ctgcgacaaa catgatggtc tcatttgtac 1620 

cagggaatgg aatatgtagc tgtggaaact gtgaatgctg ggatggatgg aatggaaatg 1680 

catgtgaaat ctggcttggc tcagaatatc cttaacaatt acatgagaga ggtctggatt 1740 

cttatttttt ctgggccatt agaacatata aatgcgaagg aaaccatgta tattcaccac 1800 

taggacaggt taaaaagacc attgtatgtt tttctatttc tgaattacga atgaaatccg 1860 

agtacctatt agaaatgagt tatgcaaatt tagatgcaaa taacattaga aaaaaaagat 1920 

tcttccataa ttaacataag tggttcctaa cgagagcaat ttttccaccc aaaagtcatt 1980 

tggcaacatc tacagacaat tttgattgtc acactgggtc gggtaggaag gtatgctgca 2040 

gacatttggt gggtagaggc cagggatgct gctgagcatc ccgcagtgta caggacagcc 2100 
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cccaaacaag gaattatcca gccccaaatg ccaatagggc tcagactgag aaacattgag 2160 

ttatatggct attagaaatc cacattctta cacaagaaag accatattag aatctaagga 2220 

aaacatgcat attcacatta attaatcgat cagatttttc cagaattccg tatcagtcac 2280 

cattttaata tggggacaat gaagacaagc acacaggagg tagaatatca gagtggggct 2340 

ggatcaaggg caaaaactgg tcattaagtc atctgacatt aaatcattta gccactaagt 2400 

tatttgtcta ctctcacttt aaactcacca aagaagattc tcttaaagaa attatgaaaa 2460 

atgtacaatt taacatttta aataaatagt gacagaagtt gtttataaaa aaaaaaaaaa 2520 

aa 2522 



<210> 24 

<211> 1344 

<212> DNA 

<213> Homo sapiens 

t<400> 24 

.gtccgagaga acaggcctgt ctcaggcagg ccctgcgcct cctatgcgga gatgctactg 60 

_ccactgctgc tgtcctcgct gctgggcggg tcccaggcta tggatgggag attctggata 120 
cgagtgcagg agtcagtgat ggtgccggag ggcctgtgca tctctgtgcc ctgctctttc 
tcctaccccc gacaagactg gacagggtct accccagctt atggctactg gttcaaagca 
gtgactgaga caaccaaggg tgctcctgtg gccacaaacc accagagtcg agaggtggaa 

atgagcaccc ggggccgatt tccaggctca ctgggggatc ccgccaaggg gaactgctcc 360 

ttggtgatca gaagacgcgc agatgcaagg atgagtcaca gtacttcttt cgggtggaga 420 

gaggaagcta tgtgagatat aatttcatga acgatgggtt ctttctaaaa gtaacagtgc 480 

tcagcttcac gcccagaccc caggaccaca acaccgacct cacctgccat gtggacttct 540 

ccagaaaggg tgtgagcgca cagaggaccg tccgactccg tgtggcctat gcccccagag 600 

accttgttat cagcatttca cgtgacaaca cgccagccct ggagccccag ccccagggga 660 

aatgtcccat acctggaagc ccaaaaaggc cagttcctgc ggctcctctg tgctgctgac 720 

agccagcccc ctgccacact gagctgggtc ctgcagaaca gagtcctctc ctcgtcccat 780 

ccctggggcc ctagacccct ggggctggag ctgcccgggg tgaaggctgg ggattcaggg 840 

cgctacacct gccgagcgga gaacaggctt ggctcccagc agcgagccct ggacctctct 900 

gtgcagtatc ctccagagaa cctgagagtg atggtttccc aagcaaacag gacagtcctg 960 

gaaaaccttg ggaacggcac gtctctccca gtactggagg gccaaagcct gtgcctggtc 1020 

tgtgtcacac acagcagccc cccagccagg ctgagctgga cccagagggg acaggttctg 1080 

agcccctccc agccctcaga ccccggggtc ctggagctgc ctcgggttca agtggagcac 1140 

gaaggagagt tcacctgcca cgctcggcac ccactgggct cccagcacgt ctctctcagc 1200 

ctctccgtgc actactcccc gaagctgctg ggcccctcct gctcctggga agctgaaggt 1260 

^ctgcactgca actgctcctc ccaggccagc ccggccccct ctctgcgctg gtggcttggg 1320 

~gaggagctgc tggaagggga acaa 1344 



180 
240 
300 



"<210> 25 

<211> 4631 

<212> DNA 

<213> Homo sapiens 



<400> 25 

ccgccgcgcc gaggaggctg ccgctctggc ttgccgcccc ccgccgccgc tgcacaccgg 60 

acccagccgc cgtgccgcgg gccatggacc tgcccagggg cctggtggtg gcctgggcgc 120 

tcagcctgtg gccagggttc acggacacct tcaacatgga caccaggaag ccccgggtca 180 

tccctggctc caggaccgcc ttctttggct acacagtgca gcagcacgac atcagtggca 240 

ataagtggct ggtcgtgggc gccccactgg aaaccaatgg ctaccagaag acgggagacg 300 

tgtacaagtg tccagtgatc cacgggaact gcaccaaact caacctggga agggtcaccc 360 

tgtccaacgt gtccgagcgg aaagacaaca tgcgcctcgg ccttagtctc gccaccaacc 420 

ccaaggacaa cagcttcctg gcctgcagcc ccctctggtc tcatgagtgt gggagctcct 480 

actacaccac agggatgtgt tcaagagtca actccaactt caggttctcc aagaccgtgg 540 

ccccagctct ccaaaggtgc cagacctaca tggacatcgt cattgtcctg gatggctcca 600 

acagcatcta cccctgggtg gaggttcagc acttcctcat caacatcctg aaaaagtttt 660 
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acattggccc agggcagatc caggttggag ttgtgcagta tggcgaagat gtggtgcatg 720 

agtttcacct caatgactac aggtctgtaa aagatgtggt ggaagctgcc agccacattg 780 

agcagagagg aggaacagag acccggacgg catttggcat tgaatttgca cgctcagagg 840 

ctttccagaa gggtggaagg aaaggagcca agaaggtgat gattgtcatc acagatgggg 900 

agtcccacga cagcccagac ctggagaagg tgatccagca aagcgaaaga gacaacgtaa 960 

caagatatgc ggtggccgtc ctgggctact acaaccgcag ggggatcaat ccagaaactt 1020 

ttctaaatga aatcaaatac atcgccagtg accctgatga caagcacttc ttcaatgtca 1080 

ctgatgaggc tgccttgaag gacattgtcg atgccctggg ggacagaatc ttcagcctgg 1140 

aaggcaccaa caagaacgag acctcctttg ggctggagat gtcacagacg ggcttttcct 1200 

cgcacgtggt ggaggatggg gttctgctgg gagccgtcgg tgcctatgac tggaatggag 1260 

ctgtgctaaa ggagacgagt gccgggaagg tcattcctct ccgcgagtcc tacctgaaag 1320 

agttccccga ggagctcaag aaccatggtg catacctggg gtacacagtc acatcggtcg 1380 

tgtcctccag gcaggggcga gtgtacgtgg ccggagcccc ccggttcaac cacacgggca 1440 

aggtcatcct gttcaccatg cacaacaacc ggagcctcac catccaccag gctatgcggg 1500 

gccagcagat aggctcttac tttgggagtg aaatcacctc ggtggacatc gacggcgacg 1560 

gcgtgactga tgtcctgctg gtgggcgcac ccatgtactt caacgagggc cgtgagcgag 1620 

gcaaggtgta cgtctatgag ctgagacaga accggtttgt ttataacgga acgctaaagg 1680 

attcacacag ttaccagaat gcccgatttg ggtcctccat tgcctcagtt cgagacctca 1740 

accaggattc ctacaatgac gtggtggtgg gagcccccct ggaggacaac cacgcaggag 1800 

ccatctacat cttccacggc ttccgaggca gcatcctgaa gacacctaag cagagaatca 1860 

cagcctcaga gctggctacc ggcctccagt attttggctg cagcatccac gggcaattgg 1920 

acctcaatga ggatgggctc atcgacctgg cagtgggagc ccttggcaac gctgtgattc 1980 

tgtggtcccg cccagtggtt cagatcaatg ccagcctcca ctttgagcca tccaagatca 2040 

acatcttcca cagagactgc aagcgcagtg gcagggatgc cacctgcctg gccgccttcc 2100 

tctgcttcac gcccatcttc ctggcacccc atttccaaac aacaactgtt ggcatcagat 2160 

acaacgccac catggatgag aagcggtata caccgagggc ccacctggac gaaggcgggg 2220 

accgattcac caacagagcc gtactgctct cctccggcca ggagctctgt gagcggatca 2280 

acttccatgt cctggacact gctgactacg tgaagccagt gaccttctca gtcgagtatt 2340 

ccctggagga ccctgaccat ggccccatgc tggacgacgg ctggcccacc actctcagag 2400 

tctcggtgcc cttctggaac ggctgcaatg aggatgagca ctgtgtccct gaccttgtgt 2460 

tggatgcccg gagtgacctg cccacggcca tggagtactg ccagagggtg ctgaggaagc 2520 

ctgcgcagga ctgctccgca tacacgctgt ccttcgacac cacagtcttc atcatagaga 2580 

gcacacgcca gcgagtggcg gtggaggcca cactggagaa caggggcgag aacgcctaca 2640 

gcacggtcct aaatatctcg cagtcagcaa acctgcagtt tgccagcttg atccagaagg 2700 

aggactcaga cggtagcatt gagtgtgtga acgaggagag gaggctccag aagcaagtct 2760 

gcaacgtcag ctatcccttc ttccgggcca aggccaaggt ggctttccgt cttgattttg 2820 

agttcagcaa atccatcttc ctacaccacc tggagatcga gctcgctgca ggcagtgaca 2880 

gtaatgagcg ggacagcacc aaggaagaca acgtggcccc cttacgcttc cacctcaaat 2940 

acgaagctga cgtcctcttc accaggagca gcagcctgag ccactacgag gtcaagctca 3000 

acagctcgct ggagagatac gatggtatcg ggcctccctt cagctgcatc ttcaggatcc 3060 

agaacttggg cttgttcccc atccacggga ttatgatgaa gatcaccatt cccatcgcca 3120 

ccaggagcgg caaccgccta ctgaagctga gggacttcct cacggacgag ggcgaacacg 3180 

tcctgtaaca tctggggcaa tagcactgag taccggccca ccccagtgga ggaagacttg 3240 

cgtcgtgctc cacagctgaa tcacagcaac tctgatgtcg tctccatcaa ctgcaatata 3300 

cggctggtcc ccaaccagga aatcaatttc catctactgg ggaacctgtg gttgaggtcc 3360 

gtaaaagcac tcaagtacaa atccatgaaa atcatggtca acgcagcgtt gcagaggcag 3420 

ttccacagcc ccttcatctt ccgtgaggag gatcccagcc gccagatcgt gtttgagatc 3480 

tccaagcaag aggactggca ggtccccatc tggatcattg taggcagcac cctggggggc 3540 

ctcctactgc tggccctgct ggtcctggca ctgtggaagc tcggcttctt tagaagtgcc 3600 

aggcgcagga gggagcctgg tctggacccc acccccaaag tgctggagtg aggctccaga 3660 

ggagactttg agttgatggg ggccagacac cagtccaggt agtgttgaga cccaggcctg 3720 

tggccccacc gagctggagc ggagaggaag ccagctggct ttgcacttga cctcatctcc 3780 

cgagcaatgg cgcctgctcc ctccagaatg gaactcaagc tggttttaag tggaactgcc 3840 

ctactgggag actgggacac ctttaacaca gacccctagg gatttaaagg gacaccccta 3900 

cacacaccca ggcccacgcc aaggcctccc tcaggctctg tggagggcat ttgctgcccc 3960 

agctactaag gtgctaggaa ttcgtaatca tccccatcct ccagagaaac ccagggagga 4020 

agactgtaaa tacgaaccca atctgcacac tccaggcctc tagttccaga aggatccaag 4080 

acaaaacaga tctgaattct gcccttttct ctcacccatc ccacccctcc attggctccc 4140 
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aagtcacacc cactcccttc cccatagata ggcccctggg gctcccgaag atgaacccaa 4200 

gagcaagggc ttgatggtga cagctgcaag ccagggatga agaaagactc tgagatgtgg 4260 

agactgatgg ccaggcaagt gggaccagga tactggacgc tgtcctgaga tgagaggtag 4320 

ccgggctctg cacccacgtg cattcacatt gaccgcaact cacacattcc cccaccagct 4380 

gcagcccctt gctctcagct gccaaccctc ccgggtcact tttgttccca ggtacctcat 4440 

gggaagcatg tggatgacac aatccctggg gctgtgcatt cccacgtctt cttgctgcag 4500 

cctgccccta gacatggacg caccggcctg gctgcagctg ggcagcaggg gtaggggtag 4560 

ggagcctccc ctccctgtat caccccctcc ctacacgtcg acgcggccgc gaattcccgg 4620 

gtcgacgagc t 4631 



<210> 26 
<211> 2008 
<212> DNA 
"jc213> Homo sapiens 

<400> 26 

^ cgggggcttt ctaacgggaa aaactctact aaagggttca aaagctggag ctccaccgcg 60 

gtggcggccg ctctagaact agtggatccc ccgggctgca ggaattcggc acgagctcgt 120 

gccgaattcg gcacgagtca cagaacacat ccatggctct matgctcagt ttggttctga 180 

gtctcctcaa gctgggwtca gggcagtggc aggtgtttgg gccagacaag cctgtccagg 240 

ccttggtggg ggaggacgca gcattctcct gtttcctgtc tcctaagacc aatgcagagg 300 

ccatggaagt gcggttcttc aggggccagt tctctagcgt ggtccacctc tacagggacg 360 

ggaaggacca gccatttatg cagatgccac agtatcaagg caggacaaaa ctggtgaagg 420 

attctattgc ggaggggcgc atctctctga ggctggaaaa cattactgtg ttggatgctg 480 

gcctctatgg gtgcaggatt agttcccagt cttactacca gaaggccatc tgggagctac 540 

aggtgtcagc actgggctca gttcctctca tttccatcac gggatatgtt gatagagaca 600 

tccagctact ctgtcagtcc tcgggctggt tcccccggcc cacagcgaag tggaaaggtc 660 

cacaaggaca ggatttgtcc acagactcca ggacaaacag agacatgcat ggcctgtttg 720 

atgtggagat ctctctgacc gtccaagaga acgccgggag catatcctgt tccatgcggc 780 

atgctcatct gagccgagag gtggaatcca gggtacagat aggagatacc tttttcgagc 840 

ctatatcgtg gmacctggyt accaaagtac tgggaatact ctgctgtggc ctattttttg 900 

gcattgttgg actgaagatt ttcttctcca aattccagtg gaaaatccag gcggaactgg 960 

actggagaag aaagcacgga caggcagaat tgagagacgc ccggaaacac gcagtggagg 1020 

tgactctgga tccagagacg gctcacccga agctctgcgt ttctgatctg aaaactgtaa 1080 

cccatagaaa agctccccag gaggtgcctc actctgagaa gagatttaca aggaagagtg 1140 

tggtggcttc tcagagtttc caagcaggga aacattactg ggaggtggac ggaggacaca 1200 

ataaaaggtg gcgcgtggga gtgtgccggg atgatgtgga caggaggaag gagtacgtga 1260 

. ctttgtctcc cgatcatggg tactgggtcc tcagactgaa tggagaacat ttgtatttca 1320 

cattaaatcc ccgttttatc agcgtcttcc ccaggacccc acctacaaaa ataggggtct 1380 

^ tcctggacta tgagtgtggg accatctcct tcttcaacat aaatgaccag tcccttattt 1440 

ataccctgac atgtcggttt gaaggcttat tgaggcccta cattgagtat ccgtcctata 1500 

atgagcaaaa tggaactccc agagacaagc aacagtgagt cctcctcaca ggcaaccacg 1560 

cccttcctcc ccaggggtga aatgtaggat gaatcacatc ccacattctt ctttagggat 1620 

attaaggtct ctctcccaga tccaaagtcc cgcagcagcc ggccaaggtg gcttccagat 1680 

gaagggggac tggcctgtcc acatgggagt caggtgtcat ggctgccctg agctgggagg 1740 

gaagaaggct gacattacat ttagtttgct ctcactccat ctggctaagt gatcttgaaa 1800 

taccacctct caggtgaaga accgtcagga attcccatct cacaggctgt ggtgtagatt 1860 

aagtagacaa ggaatgtgaa taatgcttag atcttattga tgacagagtg tatcctaatg 1920 

gtttgttcat tatattacac tttcagtaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaamc 1980 

tcgagggggg gcccggtacc caattcgg 2008 



<210> 27 

<211> 1654 

<212> DNA 

<213> Homo sapiens 
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<400> 27 

ggcacgaggg ggcaggcatg ggagccgcgc gctctctccc ggcgcccaca cctgtctgag 60 

cggcgcagcg agccgcggcc cgggcgggct gctcggcgcg gaacagtgct cggcatggca 120 

gggattccag ggctcctctt ccttctcttc tttctgctct gtgctgttgg gcaagtgagc 180 

ccttacagtg ccccctggaa acccacttgg cctgcatacc gcctccctgt cgtcttgccc 240 

cagtctaccc tcaatttagc caagccagac tttggagccg aagccaaatt agaagtatct 300 

tcttcatgtg gaccccagtg tcataaggga actccactgc ccacttacga agaggccaag 360 

caatatctgt cttatgaaac gctctatgcc aatggcagcc gcacagagac gcaggtgggc 420 

atctacatcc tcagcagtag tggagatggg gcccaacacc gagactcagg gtcttcagga 480 

aagtctcgaa ggaagcggca gatttatggc tatgacagca ggttcagcat ttttgggaag 540 

gacttcctgc tcaactaccc tttctcaaca tcagtgaagt tatccacggg ctgcaccggc 600 

accctggtgg cagagaagca tgtcctcaca gctgcccact gcatacacga tggaaaaacc 660 

tatgtgaaag gaacccagaa gcttcgagtg ggcttcctaa agcccaagtt taaagatggt 720 

ggtcgagggg ccaacgactc cacttcagcc atgcccgagc agatgaaatt tcagtggatc 780 

cgggtgaaac gcacccatgt gcccaagggt tggatcaagg gcaatgccaa tgacatcggc 840 

atggattatg attatgccct cctggaactc aaaaagcccc acaagagaaa atttatgaag 900 

attggggtga gccctcctgc taagcagctg ccagggggca gaattcactt ctctggttat 960 

gacaatgacc gaccaggcaa tttggtgtat cgcttctgtg acgtcaaaga cgagacctat 1020 

gacttgctct accagcaatg cgatgcccag ccaggggcca gcgggtctgg ggtctatgtg 1080 

aggatgtgga agagacagca gcagaagtgg gagcgaaaaa ttattggcat tttttcaggg 1140 

caccagtggg tggacatgaa tggttcccca caggatttca acgtggctgt cagaatcact 1200 

cctctcaaat atgcccagat ttgctattgg attaaaggaa actacctgga ttgtagggag 1260 

gggtgacaca gtgttccctc ctggcagcaa ttaagggtct tcatgttctt attttaggag 1320 

aggccaaatt gttttttgtc attggcgtgc acacgtgtgt gtgtgtgtgt gtgtgtgtgt 1380 

aaggtgtctt ataatctttt acctatttct tacaattgca agatgactgg ctttactatt 1440 

tgaaaactgg tttgtgtatc atatcatata tcatttaagc agtttgaagg catacttttg 1500 

catagaaata aaaaaaatac tgatttgggg caatgaggaa tatttgacaa ttaagttaat 1560 

cttcacgttt ttgcaaactt tgatttttat ttcatctgaa cttgtttcaa agatttatat 1620 

taaatatttg gcatacaaga aaaaaaaaaa aaaa 1654 



<210> 28 
<211> 1508 
<212> DNA 

<213> Homo sapiens 
<400> 28 

ggcacgagga catggcgtag tgtagtgctg ccgagcggat cccagtgtgc ggcggcagcg " 60 

gcggcggcgg cgcctcccgg gctccggctc cggcttctgc tgttgctctt ctccgccgcg 120 

gcactgatcc ccacaggtga tgggcagaat ctgtttacga aagacgtgac agtgatcgag 180 

ggagaggttg cgaccatcag ttgccaagtc aataagagtg acgactctgt gattcagcta 240 

ctgaatccca acaggcagac catttatttc agggacttca ggcctttgaa ggacagcagg 300 

tttcagttgc tgaatttttc tagcagtgaa ctcaaagtat cattgacaaa cgtctcaatt 360 

tctgatgaag gaagatactt ttgccagctc tataccgatc ccccacagga aagttacacc 420 

accatcacag tcctggtccc accacgtaat ctgatgatcg atatccagaa agacactgcg 480 

gtggaaggtg aggagattga agtcaactgc actgctatgg ccagcaagcc agccacgact 540 

atcaggtggt tcaaagggaa cacagagcta aaaggcaaat cggaggtgga agagtggtca 600 

gacatgtaca ctgtgaccag tcagctgatg ctgaaggtgc acaaggagga cgatggggtc 660 

ccagtgatct gccaggtgga gcaccctgcg gtcactggaa acctgcagac ccagcggtat 720 

ctagaagtac agtataagcc tcaagtgcac attcagatga cttatcctct acaaggctta 780 

acccgggaag gggacgcgct tgagttaaca tgtgaagcca tcgggaagcc ccagcctgtg 840 

atggtaactt gggtgagagt cgatgatgaa atgcctcaac acgccgtact gtctgggccc 900 

aacctgttca tcaataacct aaacaaaaca gataatggta cataccgctg tgaagcttca 960 

aacatagtgg ggaaagctca ctcggattat atgctgtatg tatacgatcc ccccacaact 1020 

atccctcctc ccacaacaac caccaccacc accaccacca ccaccaccac catccttacc 1080 

atcatcacag attccccgag ccaggtgaag aaggctcgat cagggcagtg gatcatgccg 1140 

tgatcggtgg cgtcgtggcg gtggtggtgt tcgccatgct gtgcttgctc atcattctgg 1200 

ggcgctattt tgccagacat aaaggtacat acttcactca tgaagccaaa ggagccgatg 1260 
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acgcagcaga cgcagacaca gctataatca atgcagaagg aggacagaac aactccgaag 1320 

aaaagaaaga gtacttcatc tagatcagcc tttttgtttc aatgaggtgt ccaactggcc 1380 

ctatttagat gataaagaga cagtgatatt ggaacttgcg agaaattcgt gtgttttttt 1440 

atgaatgggt ggaaaggtgt gagactggga aggcttggga tttgctgtgt aaaaaaaaaa 1500 

aaaaaaaa 1508 



<210> 29 
<211> 529 
<212> PRT 

<213> Homo sapiens 
<400> 29 

Met Val Ala Gin Asp Pro Gin Gly Cys Leu Gin Leu Cys Leu Ser Glu 
15 10 15 

Val Ala Asn Gly Leu Arg Asn Pro Val Ser Met Val His Ala Gly Asp 
20 25 30 

Gly Thr His Arg Phe Phe Val Ala Glu Gin Val Gly Val Val Trp Val 
35 40 45 

Tyr Leu Pro Asp Gly Ser Arg Leu Glu Gin Pro Phe Leu Asp Leu Lys 
50 55 60 

Asn He Val Leu Thr Thr Pro Trp He Gly Asp Glu Arg Gly Phe Leu 
65 70 75 80 

Gly Leu Ala Phe His Pro Lys Phe Arg His Asn Arg Lys Phe Tyr He 
85 90 95 

Tyr Tyr Ser Cys Leu Asp Lys Lys Lys Val Glu Lys He Arg He Ser 
100 105 HO 

Glu Met Lys Val Ser Arg Ala Asp Pro Asn Lys Ala Asp Leu Lys Ser 
115 120 125 

Glu Arg Val He Leu Glu He Glu Glu Pro Ala Ser Asn His Asn Gly 
130 135 . 140 

Gly Gin Leu Leu Phe Gly Leu Asp Gly Tyr Met Tyr He Phe Thr Gly 
14 5 150 155 160 

Asp Gly Gly Gin Ala Gly Asp Pro Phe Gly Leu Phe Gly Asn Ala Gin 
165 170 175 

Asn Lys Ser Ser Leu Leu Gly Lys Val Leu Arg He Asp Val Asn Arg 
180 185 190 

Ala Gly Ser His Gly Lys Arg Tyr Arg Val Pro Ser Asp Asn Pro ,Phe 
195 200 205 

Val Ser Glu Pro Gly Ala His Pro Ala He Tyr Ala Tyr Gly He Arg 
210 215 220 

Asn Met Trp Arg Cys Ala Val Asp Arg Gly Asp Pro He Thr Arg Gin 
225 230 235 240 
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Gly Arg Gly Arg He Phe Cys Gly Asp Val Gly Gin Asn Arg Phe Glu 
245 250 255 

Glu Val Asp Leu He Leu Lys Gly Gly Asn Tyr Gly Trp Arg Ala Lys 
260 265 270 

Glu Gly Phe Ala Cys Tyr Asp Lys Lys Leu Cys His Asn Ala Ser Leu 
275 280 285 

Asp Asp Val Leu Pro He Tyr Ala Tyr Gly His Ala Val Gly Lys Ser 
290 295 300 

Val Thr Gly Gly Tyr Val Tyr Arg Gly Cys Glu Ser Pro Asn Leu Asn 
305 310 315 320 

Gly Leu Tyr He Phe Gly Asp Phe Met Ser Gly Arg Leu Met Ala Leu 
325 330 335 

Gin Glu Asp Arg Lys Asn Lys Lys Trp Lys Lys Gin Asp Leu Cys Leu 
340 345 350 

Gly Ser Thr Thr Ser Cys Ala Phe Pro Gly Leu He Ser Thr His Ser 
355 ' 360 365 

Lys Phe He He Ser Phe Ala Glu Asp Glu Ala Gly Glu Leu Tyr Phe 
370 375 380 

Leu Ala Thr Ser Tyr Pro Ser Ala Tyr Ala Pro Arg Gly Ser He Tyr 
385 390 395 400 

Lys Phe Val Asp Pro Ser Arg Arg Ala Pro Pro Gly Lys Cys Lys Tyr 
405 410 415 

Lys Pro Val Pro Val Arg Thr Lys Ser Lys Arg He Pro Phe Arg Pro 
420 425 430 

Leu Ala Lys Thr Val Leu Asp Leu Leu Lys Glu Gin Ser Glu Lys Ala 
435 440 445 

Ala Arg Lys Ser Ser Ser Ala Thr Leu Ala Ser Gly Pro Ala Gin Gly 
450 455 460 

Leu Ser Glu Lys Gly Ser Ser Lys Lys Leu Ala Ser Pro Thr Ser Ser 
465 470 475 480 

Lys Asn Thr Leu Arg Gly Pro Gly Thr Lys Lys Lys Ala Arg Val Gly 
485 490 495 

Pro His Val Arg Gin Gly Lys Arg Arg Lys Ser Leu Lys Ser His Ser 
500 505 5io 

Gly Arg Met Arg Pro Ser Ala Glu Gin Lys Arg Ala Gly Arg Ser Leu 
515 520 525 

Pro 
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<210> 30 

<211> 494 

<212> PRT 

<213> Homo sapiens 

<400> 30 

Met Arg Pro Pro Gly Phe Arg Asn Phe Leu Leu Leu Ala Ser Ser Leu 
15 10 15 

Leu Phe Ala Gly Leu Ser Ala Val Pro Gin Ser Phe Ser Pro Ser Leu 
20 25 30 

Arg Ser Trp Pro Gly Ala Ala Cys Arg Leu Ser Arg Ala Glu Ser Glu 
35 40 45 

Arg Arg Cys Arg Ala Pro Gly Gin Pro Pro Gly Ala Ala Leu Cys His 
50 55 60 

Gly Arg Gly Arg Cys Asp Cys Gly Val Cys lie Cys His Val Thr Glu 
65 70 75 80 

Pro Gly Met Phe Phe Gly Pro Leu Cys Glu Cys His Glu Trp Val Cys 
85 90 95 

Glu Thr Tyr Asp Gly Ser Thr Cys Ala Gly His Gly Lys Cys Asp Cys 
100 105 110 

Gly Lys Cys Lys Cys Asp Gin Gly Trp Tyr Gly Asp Ala Cys Gin Tyr 
115 120 125 

Pro Thr Asn Cys Asp Leu Thr Lys Lys Lys Ser Asn Gin Met Cys Lys 
130 135 140 

Asn Ser Gin Asp lie lie Cys Ser Asn Ala Gly Thr Cys His Cys Gly 
145 150 155 160 

Arg Cys Lys Cys Asp Asn Ser Asp Gly Ser Gly Leu Val Tyr Gly Lys 
165 170 175 

Phe Cys Glu Cys Asp Asp Arg Glu Cys He Asp Asp Glu Thr Glu Glu 
180 185 190 

He Cys Gly Gly His Gly Lys Cys Tyr Cys Gly Asn Cys Tyr Cys Lys 
195 200 205 

Ala Gly Trp His Gly Asp Lys Cys Glu Phe Gin Cys Asp He Thr Pro 
210 215 220 

Trp Glu Ser Lys Arg Arg Cys Thr Ser Pro Asp Gly Lys He Cys Ser 
225 230 235 240 

Asn Arg Gly Thr Cys Val Cys Gly Glu Cys Thr Cys His Asp Val Asp 
245 250 255 

Pro Thr Gly Asp Trp Gly Asp He His Gly Asp Thr Cys Glu Cys Asp 
260 265 270 



Glu Arg Asp Cys Arg Ala Val Tyr Asp Arg Tyr Ser Asp Asp Phe Cys 
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275 280 285 

Ser Gly His Gly Gin Cys Asn Cys Gly Arg Cys Asp Cys Lys Ala Gly 
290 295 300 

Trp Tyr Gly Lys Lys Cys Glu His Pro Gin Ser Cys Thr Leu Ser Ala 
305 310 315 320 

Glu Glu Ser lie Arg Lys Cys Gin Gly Ser Ser Asp Leu Pro Cys Ser 
325 330 335 

Gly Arg Gly Lys Cys Glu Cys Gly Lys Cys Thr Cys Tyr Pro Pro Gly 
340 345 350 

Asp Arg Arg Val Tyr Gly Lys Thr Cys Glu Cys Asp Asp Arg Arg Cys 
355 360 365 

Glu Asp Leu Asp Gly Val Val Cys Gly Gly His Gly Thr Cys Ser Cys 
370 375 380 

Gly Arg Cys Val Cys Glu Arg Gly Trp Phe Gly Lys Leu Cys Gin His 
385 390 395 400 

Pro Arg Lys Cys Asn Met Thr Glu Glu Gin Ser Lys Asn Leu Cys Glu 
405 410 415 

Ser Ala Asp Gly lie Leu Cys Ser Gly Lys Gly Ser Cys His Cys Gly 
420 425 430 

Lys Cys He Cys Ser Ala Glu Glu Trp Tyr He Ser Gly Glu Phe Cys 
435 440 445 

Asp Cys Asp Asp Arg Asp Cys Asp Lys His Asp Gly Leu He Cys Thr 
450 455 460 

Gly Asn Gly He Cys Ser Cys Gly Asn Cys Glu Cys Trp Asp Gly Trp 
465 470 475 480 

Asn Gly Asn Ala Cys Glu He Trp Leu Gly Ser Glu Tyr Pro 
485 490 



<210> 31 
<211> 148 
<212> PRT 

<213> Homo sapiens 
<400> 31 

Met Glu Thr Gly Ala Leu Arg Arg 
1 5 

Leu Leu Cys Gly Gly Cys Pro Arg 
20 

Met Leu Glu Arg Leu Pro Leu Cys 
35 40 



Pro Gin Leu Leu Pro Leu Leu Leu 
10 15 

Ala Gly Gly Cys Asn Glu Thr Gly 
25 30 

Gly Lys Ala Phe Ala Asp Met Met 
45 



Gly Lys Val Asp Val Trp Lys Trp Cys Asn Leu Ser Glu Phe He Val 
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50 55 60 

Tyr Tyr Glu Ser Phe Thr Asn Cys Thr Glu Met Glu Ala Asn Val Val 
65 70 75 80 

Gly Cys Tyr Trp Pro Asn Pro Leu Ala Gin Gly Phe lie Thr Gly He 
85 90 95 

His Arg Gin Phe Phe Ser Asn Cys Thr Val Asp Arg Val His Leu Glu 
100 105 no 

Asp Pro Pro Asp Glu Val Leu He Pro Leu He Val He Pro Val Val 
H5 120 125 

Leu Thr Val Ala Met Ala Gly Leu Val Val Trp Arg Ser Lys Arg Thr 
130 135 140 

Asp Thr Leu Leu 
145 



<210> 32 

<211> 332 

<212> PRT 

<213> Homo sapiens 

<400> 32 

Met Arg Leu Leu Ala Phe Leu Ser Leu Leu Ala Leu Val Leu Gin Glu 
1 5 io 15 

Thr Gly Thr Ala Ser Leu Pro Arg Lys Glu Arg Lys Arg Arg Glu Glu 
20 25 30 

Gin Met Pro Arg Glu Gly Asp Ser Phe Glu Val Leu Pro Leu Arg Asn 
35 40 45 

Asp Val Leu Asn Pro Asp Asn Tyr Gly Glu Val He Asp Leu Ser Asn 
50 55 60 

Tyr Glu Glu Leu Thr Asp Tyr Gly Asp Gin Leu Pro Glu Val Lys Val 
65 70 75 80 

Thr Ser Leu Ala Pro Ala Thr Ser He Ser Pro Ala Lys Ser Thr Thr 
85 90 95 

Ala Pro Gly Thr Pro Ser Ser Asn Pro Thr Met Thr Arg Pro Thr Thr 
100 105 HO 

Ala Gly Leu Leu Leu Ser Ser Gin Pro Asn His Gly Leu Pro Thr Cys 
115 120 125 

Leu Val Cys Val Cys Leu Gly Ser Ser Val Tyr Cys Asp Asp He Asp 
130 135 140 

Leu Glu Asp He Pro Pro Leu Pro Arg Arg Thr Ala Tyr Leu Tyr Ala 
145 150 155 160 

Arg Phe Asn Arg He Ser Arg He Arg Ala Glu Asp Phe Lys Gly Leu 
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165 170 175 

Thr Lys Leu Lys Arg He Asp Leu Ser Asn Asn Leu He Ser Ser He 
180 185 190 

Asp Asn Asp Ala Phe Arg Leu Leu His Ala Leu Gin Asp Leu He Leu 
195 200 205 

Pro Glu Asn Gin Leu Glu Ala Leu Pro Val Leu Pro Ser Gly He Glu 
210 215 220 

Phe Leu Asp Val Arg Leu Asn Arg Leu Gin Ser Ser Gly He Gin Pro 
225 230 235 240 

Ala Ala Phe Arg Ala Met Glu Lys Leu Gin Phe Leu Tyr Leu Ser Asp 
245 250 255 

Asn Leu Leu Asp Ser He Pro Gly Pro Leu Pro Pro Ser Leu Arg Ser 
260 265 270 

Val His Leu Gin Asn Asn Leu He Glu Thr Met Gin Arg Asp Val Phe 
275 280 285 

Cys Asp Pro Glu Glu His Lys His Thr Arg Arg Gin Leu Glu Asp He 
290 295 300 

Arg Leu Asp Gly Asn Pro He Asn Leu Ser Leu Phe Pro Ser Ala Tyr 
3 °5 310 315 320 

Phe Cys Leu Pro Arg Leu Pro He Gly Arg Phe Thr 
325 330 



<210> 33 

<211> 639 

<212> PRT 

<213> Homo sapiens 

<400> 33 

Met Leu Leu Pro Leu Leu Leu Ser Ser Leu Leu Gly Gly Ser Gin Ala 
15 10 15 

Met Asp Gly Arg Phe Trp He Arg Val Gin Glu Ser Val Met Val Pro 
20 25 30 

Glu Ala Cys Asp He Ser Val Pro Cys Ser Phe Ser Tyr Pro Arg Gin 
35 40 45 

Asp Trp Thr Gly Ser Thr Pro Ala Tyr Gly Tyr Trp Phe Lys Ala Val 
50 55 60 

Thr Glu Thr Thr Lys Gly Ala Pro Val Ala Thr Asn His Gin Ser Arg 
65 70 75 80 

Glu Val Glu Met Ser Thr Arg Gly Arg Phe Gin Leu Thr Gly Asp Pro 
85 90 95 

Ala Lys Gly Asn Cys Ser Leu Val He Arg Asp Ala Gin Met Gin Asp 
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100 105 no 

Glu Ser Gin Tyr Phe Phe Arg Val Glu Arg Gly Ser Tyr Val Arg Tyr 
115 120 125 

Asn Phe Met Asn Asp Gly Phe Phe Leu Lys Val Thr Val Leu Ser Phe 
1^0 135 140 

Thr Pro Arg Pro Gin Asp His Asn Thr Asp Leu Thr Cys His Val Asp 
145 150 155 160 

Phe Ser Arg Lys Gly Val Ser Ala Gin Arg Thr Val Arg Leu Arg Val 
165 ivo 175 

Ala Tyr Ala Pro Arg Asp Leu Val lie Ser He Ser Arg Asp Asn Thr 
180 185 190 

~Pro Ala Leu Glu Pro Gin Pro Gin Gly Asn Val Pro Tyr Leu Glu Ala 
195 200 205 

Gin Lys Gly Gin Phe Leu Arg Leu Leu Cys Ala Ala Asp Ser Gin Pro 
210 215 220 

Pro Ala Thr Leu Ser Trp Val Leu Gin Asn Arg Val Leu Ser Ser Ser 
225 230 235 240 

His Pro Trp Gly Pro Arg Pro Leu Gly Leu Glu Leu Pro Gly Val Lys 
245 250 255 

Ala Gly Asp Ser Gly Arg Tyr Thr Cys Arg Ala Glu Asn Arg Leu Gly 
260 265 270 

Ser Gin Gin Arg Ala Leu Asp Leu Ser Val Gin Tyr Pro Pro Glu Asn 
275 280 285 

Leu Arg Val Met Val Ser Gin Ala Asn Arg Thr Val Leu Glu Asn Leu 
290 295 300 

Gly Asn Gly Thr Ser Leu Pro Val Leu Glu Gly Gin Ser Leu Cys Leu 
305 310 315 320 

Val Cys Val Thr His Ser Ser Pro Pro Ala Arg Leu Ser Trp Thr Gin 
325 330 335 

Arg Gly Gin Val Leu Ser Pro Ser Gin Pro Ser Asp Pro Gly Val Leu 
340 345 350 

Glu Leu Pro Arg Val Gin Val Glu His Glu Gly Glu Phe Thr Cys His 
355 360 365 

Ala Arg His Pro Leu Gly Ser Gin His Val Ser Leu Ser Leu Ser Val 
370 375 380 

His Tyr Ser Pro Lys Leu Leu Gly Pro Ser Cys Ser Trp Glu Ala Glu 
385 390 395 400 

Gly Leu His Cys Ser Cys Ser Ser Gin Ala Ser Pro Ala Pro Ser Leu 
405 410 415 
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Arg Trp Trp Leu Gly Glu Glu Leu Leu Glu Gly Asn Ser Ser Gin Asp 
420 425 430 

Ser Phe Glu Val Thr Pro Ser Ser Ala Gly Pro Trp Ala Asn Ser Ser 
435 440 445 

Leu Ser Leu His Gly Gly Leu Ser Ser Gly Leu Arg Leu Arg Cys Glu 
450 455 460 

Ala Trp Asn Val His Gly Ala Gin Ser Gly Ser lie Leu Gin Leu Pro 
465 470 475 480 

Asp Lys Lys Gly Leu lie Ser Thr Ala Phe Ser Asn Gly Ala Phe Leu 
485 490 > 495 

Gly lie Gly lie Thr Ala Leu Leu Phe Leu Cys Leu Ala Leu lie lie 
500 505 510 

Met Lys lie Leu Pro Lys Arg Arg Thr Gin Thr Glu Thr Pro Arg Pro 
515 520 525 

Arg Phe Ser Arg His Ser Thr He Leu Asp Tyr He Asn Val Val Pro 
530 535 540 

Thr Ala Gly Pro Leu Ala Gin Lys Arg Asn Gin Lys Ala Thr Pro Asn 
545 550 555 560 

Ser Pro Arg Thr Pro Leu Pro Pro Gly Ala Pro Ser Pro Glu Ser Lys 
565 570 575 

Lys Asn Gin Lys Lys Gin Tyr Gin Leu Pro Ser Phe Pro Glu Pro Lys 
580 585 590 

Ser Ser Thr Gin Ala Pro Glu Ser Gin Glu Ser Gin Glu Glu Leu His 
595. 600 605 

Tyr Ala Thr Leu Asn Phe Pro Gly Val Arg Pro Arg Pro Glu Ala Arg 
610 615 620 



Met Pro Lys Gly Thr Gin Ala Asp Tyr Ala Glu Val Lys Phe Gin 
625 630 635 



<210> 34 

<211> 467 

<212> PRT 

<213> Homo sapiens 

<400> 34 

Met Leu Leu Leu Leu Leu Leu Pro Leu Leu Trp Gly Arg Glu Arg Val 
15 10 15 

Glu Gly Gin Lys Ser Asn Arg Lys Asp Tyr Ser Leu Thr Met Gin Ser 
20 25 30 

Ser Val Thr Val Gin Glu Gly Met Cys Val His Val Arg Cys Ser Phe 
35 40 45 
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Ser Tyr Pro Val Asp Ser Gin Thr Asp Ser Asp Pro Val His Gly Tyr 

50 55 60 

Trp Phe Arg Ala Gly Asn Asp He Ser Trp Lys Ala Pro Val Ala Thr 
65 70 75 80 

Asn Asn Pro Ala Trp Ala Val Gin Glu Glu Thr Arg Asp Arg Phe His 
85 90 95 

Leu Leu Gly Asp Pro Gin Thr Lys Asn Cys Thr Leu Ser He Arg Asp 
100 105 no 

Ala Arg Met Ser Asp Ala Gly Arg Tyr Phe Phe Arg Met Glu Lys Gly 
115 120 125 

Asn He Lys Trp Asn Tyr Lys Tyr Asp Gin Leu Ser Val Asn Val Thr 
130 135 140 

Ala Leu Thr His Arg Pro Asn He Leu He Pro Gly Thr Leu Glu Ser 
145 150 155 160 

Gly Cys Phe Gin Asn Leu Thr Cys Ser Val Pro Trp Ala Cys Glu Gin 
165 170 175 

Gly Thr Pro Pro Met He Ser Trp Met Gly Thr Ser Val Ser Pro Leu 
180 185 190 

His Pro Ser Thr Thr Arg Ser Ser Val Leu Thr Leu He Pro Gin Pro 
195 200 205 

Gin His His Gly Thr Ser Leu Thr Cys Gin Val Thr Leu Pro Gly Ala 
210 215 220 

Gly Val Thr Thr Asn Arg Thr He Gin Leu Asn Val Ser Tyr Pro Pro 
225 230 235 240 

Gin Asn Leu Thr Val Thr Val Phe Gin Gly Glu Gly Thr Ala Ser Thr 
2 45 250 255 

Ala Leu Gly Asn Ser Ser Ser Leu Ser Val Leu Glu Gly Gin Ser Leu 
260 265 270 

Arg Leu Val Cys Ala Val Asp Ser Asn Pro Pro Ala Arg Leu Ser Trp 
275 280 285 

Thr Trp Arg Ser Leu Thr Leu Tyr Pro Ser Gin Pro Ser Asn Pro Leu 
290 295 300 

Val Leu Glu Leu Gin Val His Leu Gly Asp Glu Gly Glu Phe Thr Cys 
305 310 315 320 

Arg Ala Gin Asn Ser Leu Gly Ser Gin His Val Ser Leu Asn Leu Ser 
325 330 335 

Leu Gin Gin Glu Tyr Thr Gly Lys Met Arg Pro Val Ser Gly Val Leu 
340 345 350 
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Leu Gly Ala Val Gly Gly Ala Gly Ala Thr Ala Leu Val Phe Leu Ser 
355 360 365 

Phe Cys Val He Phe He Val Val Arg Ser Cys Arg Lys Lys Ser Ala 
370 375 380 

Arg Pro Ala Ala Asp Val Gly Asp He Gly Met Lys Asp Ala Asn Thr 
385 390 395 400 

He Arg Gly Ser Ala Ser Gin Gly Asn Leu Thr Glu Ser Trp Ala Asp 
405 410 415 

Asp Asn Pro Arg His His Gly Leu Ala Ala His Ser Ser Gly Glu Glu 
420 425 430 

Arg Glu He Gin Tyr Ala Pro Leu Ser Phe His Lys Gly Glu Pro Gin 
435 440 445 

Asp Leu Ser Gly Gin Glu Ala Thr Asn Asn Glu Tyr Ser Glu He Lys 
450 455 460 

He Pro Lys 
465 



<210> 35 

<211> 1189 

<212> PRT 

<213> Homo sapiens 

<400> 35 

Met Asp Leu Pro Arg Gly Leu Val Val Ala Trp Ala Leu Ser Leu Trp 
15 10 15 

Pro Gly Phe Thr Asp Thr Phe Asn Met Asp Thr Arg Lys Pro Arg Val 
20 25 30 

He Pro Gly Ser Arg Thr Ala Phe Phe Gly Tyr Thr Val Gin Gin His 
35 40 45 

Asp He Ser Gly Asn Lys Trp Leu Val Val Gly Ala Pro Leu Glu Thr 
50 55 60 

Asn Gly Tyr Gin Lys Thr Gly Asp Val Tyr Lys Cys Pro Val He His 
65 70 75 80 

Gly Asn Cys Thr Lys Leu Asn Leu Gly Arg Val Thr Leu Ser Asn Val 
85 90 95 

Ser Glu Arg Lys Asp Asn Met Arg Leu Gly Leu Ser Leu Ala Thr Asn 
100 105 110 

Pro Lys Asp Asn Ser Phe Leu Ala Cys Ser Pro Leu Trp Ser His Glu 
115 120 125 



Cys Gly Ser Ser Tyr Tyr Thr Thr Gly Met Cys Ser Arg Val Asn Ser 
130 135 140 
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Asn Phe Arg Phe Ser Lys Thr Val Ala Pro Ala Leu Gin Arg Cys Gin 

145 150 155 160 

Thr Tyr Met Asp He Val He Val Leu Asp Gly Ser Asn Ser He Tyr 
165 170 175 

Pro Trp Val Glu Val Gin His Phe Leu He Asn lie Leu Lys Lys Phe 
180 185 190 

Tyr He Gly Pro Gly Gin He Gin Val Gly Val Val Gin Tyr Gly Glu 
l g 5 200 205 

Asp Val Val His Glu Phe His Leu Asn Asp Tyr Arg Ser Val Lys Asp 
210 215 220 

Val Val Glu Ala Ala Ser His He Glu Gin Arg Gly Gly Thr Glu Thr 
225 230 235 240 

Arg Thr Ala Phe Gly He Glu Phe Ala Arg Ser Glu Ala Phe Gin Lys 
245 250 255 

Gly Gly Arg Lys Gly Ala Lys Lys Val Met He Val He Thr Asp Gly 
260 265 270 

Glu Ser His Asp Ser Pro Asp Leu Glu Lys Val He Gin Gin Ser Glu 
275 280 285 

Arg Asp Asn Val Thr Arg Tyr Ala Val Ala Val Leu Gly Tyr Tyr Asn 
290 295 300 

Arg Arg Gly He Asn Pro Glu Thr Phe Leu Asn Glu He Lys Tyr He 
305 310 315 320 

Ala Ser Asp Pro Asp Asp Lys His Phe Phe Asn Val Thr Asp Glu Ala 
325 330 335 

Ala Leu Lys Asp He Val Asp Ala Leu Gly Asp Arg He Phe Ser Leu 
340 345 350 

Glu Gly Thr Asn Lys Asn Glu Thr Ser Phe Gly Leu Glu Met Ser Gin 
355 360 365 

Thr Gly Phe Ser Ser His Val Val Glu Asp Gly Val Leu Leu Gly Ala 
370 375 380 

Val Gly Ala Tyr Asp Trp Asn Gly Ala Val Leu Lys Glu Thr Ser Ala 
385 390 395 400 

Gly Lys Val He Pro Leu Arg Glu Ser Tyr Leu Lys Glu Phe Pro Glu 
405 410 415 

Glu Leu Lys Asn His Gly Ala Tyr Leu Gly Tyr Thr Val Thr Ser Val 
420 425 430 

Val Ser Ser Arg Gin Gly Arg Val Tyr Val Ala Gly Ala Pro Arg Phe 
435 440 445 

Asn His Thr Gly Lys Val He Leu Phe Thr Met His Asn Asn Arg Ser 
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450 455 4 6 o 

Leu Thr lie His Gin Ala Met Arg Gly Gin Gin He Gly Ser Tyr Phe 
465 470 475 480 

Gly Ser Glu He Thr Ser Val Asp He Asp Gly Asp Gly Val Thr Asp 
485 490 495 

Val Leu Leu Val Gly Ala Pro Met Tyr Phe Asn Glu Gly Arg Glu Arg 
500 505 510 

Gly Lys Val Tyr Val Tyr Glu Leu Arg Gin Asn Arg Phe Val Tyr Asn 
515 520 525 

Gly Thr Leu Lys Asp Ser His Ser Tyr Gin Asn Ala Arg Phe Gly Ser 
530 535 540 

Ser He Ala Ser Val Arg Asp Leu Asn Gin Asp Ser Tyr Asn Asp Val 
545 550 555 560 

Val Val Gly Ala Pro Leu Glu Asp Asn His Ala Gly Ala He Tyr lie 
565 570 575 

Phe His Gly Phe Arg Gly Ser He Leu Lys Thr Pro Lys Gin Arg He 
580 585 590 

Thr Ala Ser Glu Leu Ala Thr Gly Leu Gin Tyr Phe Gly Cys Ser lie 
595 600 605 

His Gly Gin Leu Asp Leu Asn Glu Asp Gly Leu He Asp Leu Ala Val 
610 615 620 

Gly Ala Leu Gly Asn Ala Val He Leu Trp Ser Arg Pro Val Val Gin 
625 630 635 640 

He Asn Ala Ser Leu His Phe Glu Pro Ser Lys lie Asn He Phe His 
645 650 655 

Arg Asp Cys Lys Arg Ser Gly Arg Asp Ala Thr Cys Leu Ala Ala Phe 
660 665 670 

Leu Cys Phe Thr Pro He Phe Leu Ala Pro His Phe Gin Thr Thr Thr 
675 680 685 

Val Gly He Arg Tyr Asn Ala Thr Met Asp Glu Arg Arg Tyr Thr Pro 
690 695 700 

Arg Ala His Leu Asp Glu Gly Gly Asp Arg Phe Thr Asn Arg Ala Val 
705 710 715 720 

Leu Leu Ser Ser Gly Gin Glu Leu Cys Glu Arg He Asn Phe His Val 
725 730 735 

Leu Asp Thr Ala Asp Tyr Val Lys Pro Val Thr Phe Ser Val Glu Tyr 
740 745 750 

Ser Leu Glu Asp Pro Asp His Gly Pro Met Leu Asp Asp Gly Trp Pro 
755 760 765 
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Thr Thr Leu Arg Val Ser Val Pro Phe Trp Asn Gly Cys Asn Glu Asp 
770 775 780 

Glu His Cys Val Pro Asp Leu Val Leu Asp Ala Arg Ser Asp Leu Pro 
785 790 795 800 

Thr Ala Met Glu Tyr Cys Gin Arg Val Leu Arg Lys Pro Ala Gin Asp 
805 ^ 810 815 

Cys Ser Ala Tyr Thr Leu Ser Phe Asp Thr Thr Val Phe He He Glu 
820 825 830 

Ser Thr Arg Gin Arg Val Ala Val Glu Ala Thr Leu Glu Asn Arg Gly 
835 840 845 

Glu Asn Ala Tyr Ser Thr Val Leu Asn He Ser Gin Ser Ala Asn Leu 
850 855 860 

Gin Phe Ala Ser Leu He Gin Lys Glu Asp Ser Asp Gly Ser He Glu 
865 870 875 880 

Cys Val Asn Glu Glu Arg Arg Leu Gin Lys Gin Val Cys Asn Val Ser 
885 890 895 

Tyr Pro Phe Phe Arg Ala Lys Ala Lys Val Ala Phe Arg Leu Asp Phe 
900 905 910 

Glu Phe Ser Lys Ser He Phe Leu His His Leu Glu He Glu Leu Ala 
915 920 925 

Ala Gly Ser Asp Ser Asn Glu Arg Asp Ser Thr Lys Glu Asp Asn Val 
930 935 940 

Ala Pro Leu Arg Phe His Leu Lys Tyr Glu Ala Asp Val Leu Phe Thr 
945 950 955 960 

. Arg Ser Ser Ser Leu Ser His Tyr Glu Val Lys Leu Asn Ser Ser Leu 
965 970 975 

.Glu. Arg Tyr Asp Gly He Gly Pro Pro Phe Ser Cys He Phe Arg He 
980 985 990 

Gin Asn Leu Gly Leu Phe Pro He His Gly He Met Met Lys He Thr 
995 1000 1005 

He Pro He Ala Thr Arg Ser Gly Asn Arg Leu Leu Lys Leu Arg Asp 
1010 1015 1020 

Phe Leu Thr Asp Glu Val Ala Asn Thr Ser Cys Asn He Trp Gly Asn 
1030 1035 1040 

Ser Thr Glu Tyr Arg Pro Thr Pro Val Glu Glu Asp Leu Arg Arg Ala 
1045 1050 1055 

Pro Gin Leu Asn His Ser Asn Ser Asp Val Val Ser He Asn Cys Asn 
1060 1065 1070 
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lie Arg Leu Val Pro Asn Gin Glu He Asn Phe His Leu Leu Gly Asn 
107 5 1080 1085 

Leu Trp Leu Arg Ser Leu Lys Ala Leu Lys Tyr Lys Ser Met Lys He 
1090 1095 noo 

Met Val Asn Ala Ala Leu Gin Arg Gin Phe His Ser Pro Phe He Phe 
1H0 1115 H20 

Arg Glu Glu Asp Pro Ser Arg Gin He Val Phe Glu He Ser Lys Gin 
1125 H30 H35 

Glu Asp Trp Gin Val Pro He Trp He He Val Gly Ser Thr Leu Gly 
1140 H45 H50 

Gly Leu Leu Leu Leu Ala Leu Leu Val Leu Ala Leu Trp Lys Leu Gly 
1155 H60 H65 

Phe Phe Arg Ser Ala Arg Arg Arg Arg Glu Pro Gly Leu Asp Pro Thr 
H70 1175 H80 

Pro Lys Val Leu Glu 



<210> 36 

<211> 196 

<212> PRT 

<213> Homo sapiens 

<400> 36 

Met Ser Arg Arg Ser Met Leu Leu Ala Trp Ala Leu Pro Ser Leu Leu 
15 10 15 

Arg Leu Gly Ala Ala Gin Glu Thr Glu Asp Pro Ala Cys Cys Ser Pro 
20 25 30 

He Val Pro Arg Asn Glu Trp Lys Ala Leu Ala Ser Glu Cys Ala Gin 
35 40 45 

His Leu Ser Leu Pro Leu Arg Tyr Val Val Val Ser His Thr Ala Gly 
50 55 60 

Ser Ser Cys Asn Thr Pro Ala Ser Cys Gin Gin Gin Ala Arg Asn Val 
65 70 75 80 

Gin His Tyr His Met Lys Thr Leu Gly Trp Cys Asp Val Gly Tyr Asn 
85 90 95 

Phe Leu He Gly Glu Asp Gly Leu Val Tyr Glu Gly Arg Gly Trp Asn 
100 105 no 

Phe Thr Gly Ala His Ser Gly His Leu Trp Asn Pro Met Ser He Gly 
H5 120 125 

He Ser Phe Met Gly Asn Tyr Met Asp Arg Val Pro Thr Pro Gin Ala 
230 135 140 
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lie Arg Ala Ala Gin Gly Leu Leu Ala Cys Gly Val Ala Gin Gly Ala 

145 150 155 160 

Leu Arg Ser Asn Tyr Val Leu Lys Gly His Arg Asp Val Gin Arg Thr 
165 170 175 

Leu Ser Pro Gly Asn Gin Leu Tyr His Leu lie Gin Asn Trp Pro His 
180 185 190 

Tyr Arg Ser Pro 
195 



<210> 37 
<211> 319 
* <212> PRT 
<213> Homo sapiens 

* <220> 
<221> SITE 
<222> (319) 

<223> Xaa equals stop translation 
<400> 37 

Met Ala Leu Met Leu Ser Leu Val Leu Ser Leu Leu Lys Leu Gly Ser 
1 5 io 15 

Gly Gin Trp Gin Val Phe Gly Pro Asp Lys Pro Val Gin Ala Leu Val 
20 25 30 

Gly Glu Asp Ala Ala Phe Ser Cys Phe Leu Ser Pro Lys Thr Asn Ala 
35 40 45 

Glu Ala Met Glu Val Arg Phe Phe Arg Gly Gin Phe Ser Ser Val Val 
50 55 60 

His Leu Tyr Arg Asp Gly Lys Asp Gin Pro Phe Met Gin Met Pro Gin 
65 70 75 80 

Tyr Gin Gly Arg Thr Lys Leu Val Lys Asp Ser lie Ala Glu Gly Arg 
85 90 95 

He Ser Leu Arg Leu Glu Asn He Thr Val Leu Asp Ala Gly Leu Tyr 
100 105 HO 

Gly Cys Arg He Ser Ser Gin Ser Tyr Tyr Gin Lys Ala He Trp Glu 
115 120 125 

Leu Gin Val Ser Ala Leu Gly Ser Val Pro Leu He Ser He Ala Gly 
130 135 140 

Tyr Val Asp Arg Asp He Gin Leu Leu Cys Gin Ser Ser Gly Trp Phe 
I 45 150 155 160 

Pro Arg Pro Thr Ala Lys Trp Lys Gly Pro Gin Gly Gin Asp Leu Ser 
165 170 175 

Thr Asp Ser Arg Thr Asn Arg Asp Met His Gly Leu Phe Asp Val Glu 
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180 



185 



190 



He Ser Leu Thr Val Gin Glu Asn Ala Gly Ser He Ser Cys Ser Met 
195 200 205 

Arg His Ala His Leu Ser Arg Glu Val Glu Ser Arg Val Gin He Gly 
210 215 220 

Asp Trp Arg Arg Lys His Gly Gin Ala Gly Lys Arg Lys Tyr Ser Ser 
225 230 235 240 

Ser His He Tyr Asp Ser Phe Pro Ser Leu Ser Phe Met Asp Phe Tyr 
245 250 255 

He Leu Arg Pro Val Gly Pro Cys Arg Ala Lys Leu Val Met Gly Thr 
260 265 270 

Leu Lys Leu Gin He Leu Gly Glu Val His Phe Val Glu Lys Pro His 
275 280 285 

Ser Leu Leu Gin He Ser Gly Gly Ser Thr Thr Leu Lys Lys Gly Pro 
290 295 300 



Asn Pro Trp Ser Phe Pro Ser Pro Cys Ala Leu Phe Pro Thr Xaa 
305 310 315 



<210> 38 

<211> 375 

<212> PRT 

<213> Homo sapiens 

<400> 38 

Met Ala Gly He Pro Gly Leu Leu Phe Leu Leu Phe Phe Leu Leu Cys 
1 5 10 15 

Ala Val Gly Gin Val Ser Pro Tyr Ser Ala Pro Trp Lys Pro Thr Trp 
20 25 30 

Pro Ala Tyr Arg Leu Pro Val Val Leu Pro Gin Ser Thr Leu Asn Leu 
35 40 45 

Ala Lys Pro Asp Phe Gly Ala Glu Ala Lys Leu Glu Val Ser Ser Ser 
50 55 60 

Cys Gly Pro Gin Cys His Lys Gly Thr Pro Leu Pro Thr Tyr Glu Glu 
6 * 70 75 80 

Ala Lys Gin Tyr Leu Ser Tyr Glu Thr Leu Tyr Ala Asn Gly Ser Arg 
85 90 95 

l 

Thr Glu Thr Gin Val Gly He Tyr He Leu Ser Ser Ser Gly Asp Gly 
100 105 110 

Ala Gin His Arg Asp Ser Gly Ser Ser Gly Lys Ser Arg Arg Lys Arg 
115 120 125 

Gin He Tyr Gly Tyr Asp Ser Arg Phe Ser He Phe Gly Lys Asp Phe 
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130 



135 



140 



Leu Leu Asn Tyr Pro Phe Ser Thr Ser Val Lys Leu Ser Thr Gly Cys 
"5 150 155 160 

Thr Gly Thr Leu Val Ala Glu Lys His Val Leu Thr Ala Ala His Cys 
165 170 175 

lie His Asp Gly Lys Thr Tyr Val Lys Gly Thr Gin Lys Leu Arg Val 
180 185 190 

Gly Phe Leu Lys Pro Lys Phe Lys Asp Gly Gly Arg Gly Ala Asn Asp 
195 200 205 

Ser Thr Ser Ala Met Pro Glu Gin Met Lys Phe Gin Trp He Arg Val 
210 215 220 

Lys Arg Thr His Val Pro Lys Gly Trp He Lys Gly Asn Ala Asn Asp 
225 230 235 240 

He Gly Met Asp Tyr Asp Tyr Ala Leu Leu Glu Leu Lys Lys Pro His 
245 250 255 

Lys Arg Lys Phe Met Lys He Gly Val Ser Pro Pro Ala Lys Gin Leu 
260 265 270 

Pro Gly Gly Arg He His Phe Ser Gly Tyr Asp Asn Asp Arg Pro Gly 
275 280 285 

Asn Leu Val Tyr Arg Phe Cys Asp Val Lys Asp Glu Thr Tyr Asp Leu 
290 295 300 

Leu Tyr Gin Gin Cys Asp Ser Gin Pro Gly Ala Ser Gly Ser Gly Val 
305 310 315 320 

Tyr Val Arg Met Trp Lys Arg Gin His Gin Lys Trp Glu Arg Lys He 
325 330 335 

He Gly Met He Ser Gly His Gin Trp Val Asp Met Asp Gly Ser Pro 
340 345 350 

Gin Glu Phe Thr Arg Gly Cys Ser Glu He Thr Pro Leu Gin Tyr He 
355 360 365 



Pro Asp He Ser He Gly Val 
370 375 



<210> 39 
<211> 364 
<212> PRT 

<213> Homo sapiens 
<400> 39 

Met Ala Ser Val Val Leu Pro Ser Gly Ser Gin Cys Ala Ala Ala Ala 
is io 15 



Ala Ala Ala Ala Pro Pro Gly Leu Arg Leu Arg Leu Leu Leu Leu Leu 
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20 25 30 

Phe Ser Ala Ala Ala Leu lie Pro Thr Gly Asp Gly Gin Asn Leu Phe 
35 40 45 

Thr Lys Asp Val Thr Val He Glu Gly Glu Val Ala Thr He Ser Cys 
50 55 60 

Gin Val Asn Lys Ser Asp Asp Ser Val He Gin Leu Leu Asn Pro Asn 
65 70 75 80 

Arg Gin Thr He Tyr Phe Arg Asp Phe Arg Pro Leu Lys Asp Ser Arg 
85 90 95 

Phe Gin Leu Leu Asn Phe Ser Ser Ser Glu Leu Lys Val Ser Leu Thr 
100 105 HO 

Asn Val Ser He Ser Asp Glu Gly Arg Tyr Phe Cys Gin Leu Tyr Thr 
115 120 125 

Asp Pro Pro Gin Glu Ser Tyr Thr Thr He Thr Val Leu Val Pro Pro 
130 135 140 

Arg Asn Leu Met He Asp He Gin Lys Asp Thr Ala Val Glu Gly Glu 
145 150 155 160 

Glu lie Glu Val Asn Cys Thr Ala Met Ala Ser Lys Pro Ala Thr Thr 
165 170 175 

He Arg Trp Phe Lys Gly Asn Thr Glu Leu Lys Gly Lys Ser Glu Val 
180 185 190 

Glu Glu Trp Ser Asp Met Tyr Thr Val Thr Ser Gin Leu Met Leu Lys 
195 200 205 

Val His Lys Glu Asp Asp Gly Val Pro Val He Cys Gin Val Glu His 
210 215 220 

Pro Ala Val Thr Gly Asn Leu Gin Thr Gin Arg Tyr Leu Glu Val Gin 
225 230 235 240 

Tyr Lys Pro Gin Val His He Gin Met Thr Tyr Pro Leu Gin Gly Leu 
245 250 255 

Thr Arg Glu Gly Asp Ala Leu Glu Leu Thr Cys Glu Ala He Gly Lys 
260 265 270 

Pro Gin Pro Val Met Val Thr Trp Val Arg Val Asp Asp Glu Met Pro 
275 280 285 

Gin His Ala Val Leu Ser Gly Pro Asn Leu Phe He Asn Asn Leu Asn 
290 295 300 

Lys Thr Asp Asn Gly Thr Tyr Arg Cys Glu Ala Ser Asn He Val Gly 
305 310 315 320 

Lys Ala His Ser Asp Tyr Met Leu Tyr Val Tyr Asp Pro Pro Thr Thr 
325 330 335 
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lie Pro Pro Pro Thr Thr Thr Thr 
340 

Thr lie Leu Thr He He Thr Asp 
355 360 



Thr Thr Thr Thr Thr Thr Thr Thr 
345 350 

Ser Arg Ala Arg 



<210> 40 

<211> 66 

<212> PRT 

<213> Homo sapiens 

<400> 40 

Met Ser Ser Ser Ser Leu Lys His 
1 5 

Phe Ser Ser Phe He Ser Gly Glu 
20 

Phe Phe Leu Pro Tyr Pro Ser Ser 
35 40 

Cys Ser He Leu Asp Pro Phe Ser 
50 55 

Glu Asn 
65 



Leu Leu Cys Met Ala Leu Ser Trp 
10 15 

Thr Ser Phe Ser Leu Leu Asn Ser 
25 30 

Arg Cys Cys Cys Phe Ser Val Gin 
45 

Cys Asn Ser Met Arg Phe Pro Trp 
60 



<210> 41 

<211> 469 

<212> PRT 

<213> Homo sapiens 

<400> 41 

Met Arg Pro Pro Gly Phe Arg Asn Phe Leu Leu Leu Ala Ser Ser Leu 
15 10 15 

Leu Phe Ala Gly Leu Ser Ala Val Pro Gin Ser Phe Ser Pro Ser Leu 
20 25 30 

Arg Ser Trp Pro Gly Ala Ala Cys Arg Leu Ser Arg Ala Glu Ser Glu 
35 40 45 

Arg Arg Cys Arg Ala Pro Gly Gin Pro Pro Gly Ala Ala Leu Cys His 
50 55 60 

Gly Arg Gly Arg Cys Asp Cys Gly Val Cys He Cys His Val Thr Glu 
65 70 75 80 

Pro Gly Met Phe Phe Gly Pro Leu Cys Glu Cys His Glu Trp Val Cys 
85 90 95 

Glu Thr Tyr Asp Gly Ser Thr Cys Ala Gly His Gly Lys Cys Asp Cys 
100 105 HO 

Gly Lys Cys Lys Cys Asp Gin Gly Trp Tyr Gly Asp Ala Cys Gin Tyr 
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115 120 125 

Pro Thr Asn Cys Asp Leu Thr Lys Lys Lys Ser Asn Gin Met Cys Lys 
130 135 140 

Asn Ser Gin Asp He He Cys Ser Asn Ala Gly Thr Cys His Cys Gly 
I 45 150 155 160 

Arg Cys Lys Cys Asp Asn Ser Asp Gly Ser Gly Leu Val Tyr Gly Lys 
165 170 175 

Phe Cys Glu Cys Asp Asp Arg Glu Cys He Asp Asp Glu Thr Glu Glu 
180 185 190 

He Cys Gly Gly His Gly Lys Cys Tyr Cys Gly Asn Cys Tyr Cys Lys 
195 200 205 

Ala Gly Trp His Gly Asp Lys Cys Glu Phe Gin Cys Asp He Thr Pro 
210 215 220 

Trp Glu Ser Lys Arg Arg Cys Thr Ser Pro Asp Gly Lys He Cys Ser 
225 230 235 240 

Ser Arg Gly Thr Cys Val Cys Gly Glu Cys Thr Cys His Asp Val Asp 
2 45 250 255 

Pro Thr Gly Asp Trp Gly Asp He His Gly Asp Thr Cys Glu Cys Asp 
260 265 270 

Glu Arg Asp Cys Arg Ala Val Tyr Asp Arg Tyr Ser Asp Asp Phe Cys 
2 75 280 285 

Ser Gly His Gly Gin Cys Asn Cys Gly Arg Cys Asp Cys Lys Ala Gly 
290 295 300 

Trp Tyr Gly Lys Lys Cys Glu His Pro Gin Ser Cys Thr Leu Ser Ala 
305 310 315 320 

Glu Glu Ser He Arg Lys Cys Gin Gly Ser Ser Asp Leu Pro Cys Ser 
325 330 335 

Gly Arg Gly Lys Cys Glu Cys Gly Lys Cys Thr Cys Tyr Pro Pro Gly 
340 345 350 

Asp Arg Arg Val Tyr Gly Lys Thr Cys Glu Cys Asp Asp Arg Arg Cys 
355 360 365 

Glu Asp Leu Asp Gly Val Val Cys Gly Gly His Gly Thr Cys Ser Cys 
370 375 380 

Gly Arg Cys Val Cys Glu Arg Gly Trp Phe Gly Lys Leu Cys Gin His 
385 390 395 400 

Pro Arg Lys Cys Asn Met Thr Glu Glu Gin Ser Lys Asn Leu Cys Glu 
405 410 415 

Ser Ala Asp Gly lie Leu Cys Ser Gly Lys Gly Ser Cys His Cys Gly 
420 425 430 
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Lys Cys He Cys Ser Ala Glu Glu Trp Tyr He Ser Gly Glu Phe Cys 
435 440 445 

Asp Cys Asp Asp Arg Asp Cys Asp Lys His Asp Gly Leu He Cys Thr 
4 50 455 460 

Arg Glu Trp Asn Met 
465 



<210> 42 

<211> 127 

<212> PRT 

<213> Homo sapiens 

<400> 42 

Met Leu Leu Pro Leu Leu Leu Ser Ser Leu Leu Gly Gly Ser Gin Ala 
1 5 io 15 

Met Asp Gly Arg Phe Trp He Arg Val Gin Glu Ser Val Met Val Pro 
20 25 30 

Glu Gly Leu Cys He Ser Val Pro Cys Ser Phe Ser Tyr Pro Arg Gin 
35 40 45 

Asp Trp Thr Gly Ser Thr Pro Ala Tyr Gly Tyr Trp Phe Lys Ala Val 
50 55 60 

Thr Glu Thr Thr Lys Gly Ala Pro Val Ala Thr Asn His Gin Ser Arg 
65 70 75 80 

Glu Val Glu Met Ser Thr Arg Gly Arg Phe Pro Gly Ser Leu Gly Asp 
85 90 95 

Pro Ala Lys Gly Asn Cys Ser Leu Val He Arg Arg Arg Ala Asp Ala 
100 105 HO 

Arg Met Ser His Ser Thr Ser Phe Gly Trp Arg Glu Glu Ala Met 
115 120 125 



<210> 43 

<211> 1034 

<212> PRT 

<213> Homo sapiens 

<400> 43 

Met Asp Leu Pro Arg Gly Leu Val Val Ala Trp Ala Leu Ser Leu Trp 
1 5 10 is 

Pro Gly Phe Thr Asp Thr Phe Asn Met Asp Thr Arg Lys Pro Arg Val 
20 25 30 

He Pro Gly Ser Arg Thr Ala Phe Phe Gly Tyr Thr Val Gin Gin His 
35 40 45 

Asp He Ser Gly Asn Lys Trp Leu Val Val Gly Ala Pro Leu Glu Thr 
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50 55 60 

Asn Gly Tyr Gin Lys Thr Gly Asp Val Tyr Lys Cys Pro Val He His 
65 70 75 80 

Gly Asn Cys Thr Lys Leu Asn Leu Gly Arg Val Thr Leu Ser Asn Val 
85 90 95 

Ser Glu Arg Lys Asp Asn Met Arg Leu Gly Leu Ser Leu Ala Thr Asn 
100 105 HO 

Pro Lys Asp Asn Ser Phe Leu Ala Cys Ser Pro Leu Trp Ser His Glu 
115 120 125 

Cys Gly Ser Ser Tyr Tyr Thr Thr Gly Met Cys Ser Arg Val Asn Ser 
130 135 140 

Asn Phe Arg Phe Ser Lys Thr Val Ala Pro Ala Leu Gin Arg Cys Gin 
145 150 155 160 

Thr Tyr Met Asp He Val He Val Leu Asp Gly Ser Asn Ser He Tyr 
165 170 175 

Pro Trp Val Glu Val Gin His Phe Leu He Asn lie Leu Lys Lys Phe 
180 185 190 

Tyr He Gly Pro Gly Gin He Gin Val Gly Val Val Gin Tyr Gly Glu 
195 200 205 

Asp Val Val His Glu Phe His Leu Asn Asp Tyr Arg Ser Val Lys Asp 
210 215 220 

Val Val Glu Ala Ala Ser His lie Glu Gin Arg Gly Gly Thr Glu Thr 
225 230 235 240 

Arg Thr Ala Phe Gly He Glu Phe Ala Arg Ser Glu Ala Phe Gin Lys 
245 250 255 

Gly Gly Arg Lys Gly Ala Lys Lys Val Met He Val He Thr Asp Gly 
260 265 270 

Glu Ser His Asp Ser Pro Asp Leu Glu Lys Val lie Gin Gin Ser Glu 
275 280 285 

Arg Asp Asn Val Thr Arg Tyr Ala Val Ala Val Leu Gly Tyr Tyr Asn 
290 295 300 

Arg Arg Gly lie Asn Pro Glu Thr Phe Leu Asn Glu lie Lys Tyr lie 
305 310 315 320 

Ala Ser Asp Pro Asp Asp Lys His Phe Phe Asn Val Thr Asp Glu Ala 
325 330 335 

Ala Leu Lys Asp He Val Asp Ala Leu Gly Asp Arg He Phe Ser Leu 
340 345 350 

Glu Gly Thr Asn Lys Asn Glu Thr Ser Phe Gly Leu Glu Met Ser Gin 
355 360 365 
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Thr Gly Phe Ser Ser His Val Val Glu Asp Gly Val Leu Leu Gly Ala 
370 375 380 

Val Gly Ala Tyr Asp Trp Asn Gly Ala Val Leu Lys Glu Thr Ser Ala 
385 390 395' 400 

Gly Lys Val He Pro Leu Arg Glu Ser Tyr Leu Lys Glu Phe Pro Glu 
405 410 415 

Glu Leu Lys Asn His Gly Ala Tyr Leu Gly Tyr Thr Val Thr Ser Val 
420 425 430 

Val Ser Ser Arg Gin Gly Arg Val Tyr Val Ala Gly Ala Pro Arg Phe 
435 440 445 

Asn His Thr Gly Lys Val He Leu Phe Thr Met His Asn Asn Arg Ser 
450 455 460 

Leu Thr He His Gin Ala Met Arg Gly Gin Gin He Gly Ser Tyr Phe 
465 470 475 480 

Gly Ser Glu He Thr Ser Val Asp He Asp Gly Asp Gly Val Thr Asp 
485 490 495 

Val Leu Leu Val Gly Ala Pro Met Tyr Phe Asn Glu Gly Arg Glu Arg 
500 505 510 

Gly Lys Val Tyr Val Tyr Glu Leu Arg Gin Asn Arg Phe Val Tyr Asn 
515 520 525 

Gly Thr Leu Lys Asp Ser His Ser Tyr Gin Asn Ala Arg Phe Gly Ser 
530 535 540 

Ser He Ala Ser Val Arg Asp Leu Asn Gin Asp Ser Tyr Asn Asp Val 
545 550 555 560 

Val Val Gly Ala Pro Leu Glu Asp Asn His Ala Gly Ala He Tyr He 
565 570 575 

Phe His Gly Phe Arg Gly Ser He Leu Lys Thr Pro Lys Gin Arg He 
580 585 590 

Thr Ala Ser Glu Leu Ala Thr Gly Leu Gin Tyr Phe Gly Cys Ser He 
595 600 605 

His Gly Gin Leu Asp Leu Asn Glu Asp Gly Leu He Asp Leu Ala Val 
610 615 620 

Gly Ala Leu Gly Asn Ala Val He Leu Trp Ser Arg Pro Val Val Gin 
625 630 635 640 

He Asn Ala Ser Leu His Phe Glu Pro Ser Lys He Asn He Phe His 
645 650 655 

Arg Asp Cys Lys Arg Ser Gly Arg Asp Ala Thr Cys Leu Ala Ala Phe 
660 665 670 
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Leu Cys Phe Thr Pro He Phe Leu Ala Pro His Phe Gin Thr Thr Thr 
675 680 685 

Val Gly He Arg Tyr Asn Ala Thr Met Asp Glu Lys Arg Tyr Thr Pro 
690 695 700 

Arg Ala His Leu Asp Glu Gly Gly Asp Arg Phe Thr Asn Arg Ala Val 
705 710 715 720 

Leu Leu Ser Ser Gly Gin Glu Leu Cys Glu Arg He Asn Phe His Val 
725 730 735 

Leu Asp Thr Ala Asp Tyr Val Lys Pro Val Thr Phe Ser Val Glu Tyr 
740 745 750 

Ser Leu Glu Asp Pro Asp His Gly Pro Met Leu Asp Asp Gly Trp Pro 
755 760 765 

Thr Thr Leu Arg Val Ser Val Pro Phe Trp Asn Gly Cys Asn Glu Asp 
770 775 780 

Glu His Cys Val Pro Asp Leu Val Leu Asp Ala Arg Ser Asp Leu Pro 
785 790 795 800 

Thr Ala Met Glu Tyr Cys Gin Arg Val Leu Arg Lys Pro Ala Gin Asp 
805 810 815 

Cys Ser Ala Tyr Thr Leu Ser Phe Asp Thr Thr Val Phe He He Glu 
820 825 830 

Ser Thr Arg Gin Arg Val Ala Val Glu Ala Thr Leu Glu Asn Arg Gly 
835 840 845 

Glu Asn Ala Tyr Ser Thr Val Leu Asn He Ser Gin Ser Ala Asn Leu 
850 855 860 

Gin Phe Ala Ser Leu He Gin Lys Glu Asp Ser Asp Gly Ser He Glu 
865 870 875 880 

Cys Val Asn Glu Glu Arg Arg Leu Gin Lys Gin Val Cys Asn Val Ser 
885 890 895 

Tyr Pro Phe Phe Arg Ala Lys Ala Lys Val Ala Phe Arg Leu Asp Phe 
900 J 905 910 

Glu Phe Ser Lys Ser He Phe Leu His His Leu Glu lie Glu Leu Ala 
915 920 925 

Ala Gly Ser Asp Ser Asn Glu Arg Asp Ser Thr Lys < Glu Asp Asn Val 
930 935 940 

Ala Pro Leu Arg Phe His Leu Lys Tyr Glu Ala Asp Val Leu Phe Thr 
945 950 955 960 

Arg Ser Ser Ser Leu Ser His Tyr Glu Val Lys Leu Asn Ser Ser Leu 
965 970 975 



Glu Arg Tyr Asp Gly He Gly Pro Pro Phe Ser Cys He Phe Arg He 
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980 985 990 

Gin Asn Leu Gly Leu Phe Pro He His Gly He Met Met Lys He Thr 
995 1000 1005 

He Pro He Ala Thr Arg Ser Gly Asn Arg Leu Leu Lys Leu Arg Asp 
1010 1015 1020 

Phe Leu Thr Asp Glu Gly Glu His Val Leu 
1030 



<210> 44 

<211> 461 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (234) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (236) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 44 

Met Ala Leu Met Leu Ser Leu Val Leu Ser Leu Leu Lys Leu Gly Ser 
1 5 10 15 

Gly Gin Trp Gin Val Phe Gly Pro Asp Lys Pro Val Gin Ala Leu Val 
20 25 30 

Gly Glu Asp Ala Ala Phe Ser Cys Phe Leu Ser Pro Lys Thr Asn Ala 
35 40 45 

Glu Ala Met Glu Val Arg Phe Phe Arg Gly Gin Phe Ser Ser Val Val 
50 55 60 

His Leu Tyr Arg Asp Gly Lys Asp Gin Pro Phe Met Gin Met Pro Gin 
65 70 75 80 

Tyr Gin Gly Arg Thr Lys Leu Val Lys Asp Ser He Ala Glu Gly Arg 
85 90 95 

He Ser Leu Arg Leu Glu Asn He Thr Val Leu Asp Ala Gly Leu Tyr 
100 105 HO 

Gly Cys Arg He Ser Ser Gin Ser Tyr Tyr Gin Lys Ala He Trp Glu 
115 120 125 

Leu Gin Val Ser Ala Leu Gly Ser Val Pro Leu He Ser He Thr Gly 
130 135 140 

Tyr Val Asp Arg Asp He Gin Leu Leu Cys Gin Ser Ser Gly Trp Phe 
14 5 150 155 160 
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Pro Arg Pro Thr Ala Lys Trp Lys Gly Pro Gin Gly Gin Asp Leu Ser 
165 170 175 

Thr Asp Ser Arg Thr Asn Arg Asp Met His Gly Leu Phe Asp Val Glu 
180 185 190 

He Ser Leu Thr Val Gin Glu Asn Ala Gly Ser He Ser Cys Ser Met 
195 200 205 

Arg His Ala His Leu Ser Arg Glu Val Glu Ser Arg Val Gin He Gly 
210 215 220 

Asp Thr Phe Phe Glu Pro He Ser Trp Xaa Leu Xaa Thr Lys Val Leu 
225 230 235 240 

Gly He Leu Cys Cys Gly Leu Phe Phe Gly lie Val Gly Leu Lys He 
245 250 255 

Phe Phe Ser Lys Phe Gin Trp Lys He Gin Ala Glu Leu Asp Trp Arg 
260 265 270 

Arg Lys His Gly Gin Ala Glu Leu Arg Asp Ala Arg Lys His Ala Val 
275 280 285 

Glu Val Thr Leu Asp Pro Glu Thr Ala His Pro Lys Leu Cys Val Ser 
290 295 300 

Asp Leu Lys Thr Val Thr His Arg Lys Ala Pro Gin Glu Val Pro His 
305 310 315 320 

Ser Glu Lys Arg Phe Thr Arg Lys Ser Val Val Ala Ser Gin Ser Phe 
325 330 335 

Gin Ala Gly Lys His Tyr Trp Glu Val Asp Gly Gly His Asn Lys Arg 
340 345 350 

Trp Arg Val Gly Val Cys Arg Asp Asp Val Asp Arg Arg Lys Glu Tyr 
355 360 365 

Val Thr Leu Ser Pro Asp His Gly Tyr Trp Val Leu Arg Leu Asn Gly 
370 375 380 

Glu His Leu Tyr Phe Thr Leu Asn Pro Arg Phe He Ser Val Phe Pro 
385 390 395 400 

Arg Thr Pro Pro Thr Lys He Gly Val Phe Leu Asp Tyr Glu Cys Gly 
405 410 415 

Thr He Ser Phe Phe Asn He Asn Asp Gin Ser Leu He Tyr Thr Leu 
420 425 430 

Thr Cys Arg Phe Glu Gly Leu Leu Arg Pro Tyr He Glu Tyr Pro Ser 
435 440 445 



Tyr Asn Glu Gin Asn Gly Thr Pro Arg Asp Lys Gin Gin 
450 455 460 



WO 00/29435 

<210> 45 
<211> 383 
<212> PRT 

<213> Homo sapiens 



PCT/US99/25031 
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275 280 285 

Asn Leu Val Tyr Arg Phe Cys Asp Val Lys Asp Glu Thr Tyr Asp Leu 
290 295 300 

Leu Tyr Gin Gin Cys Asp Ala Gin Pro Gly Ala Ser Gly Ser Gly Val 
305 310 315 320 

Tyr Val Arg Met Trp Lys Arg Gin Gin Gin Lys Trp Glu Arg Lys lie 
325 330 335 

lie Gly lie Phe Ser Gly His Gin Trp Val Asp Met Asn Gly Ser Pro 
340 345 350 

Gin Asp Phe Asn Val Ala Val Arg He Thr Pro Leu Lys Tyr Ala Gin 
355 360 365 

He Cys Tyr Trp He Lys Gly Asn Tyr Leu Asp Cys Arg Glu Gly 
370 375 380 



<210> 46 

<211> 229 

<212> PRT 

<213> Homo sapiens 

<400> 46 

Met He Asp He Gin Lys Asp Thr Ala Val Glu Gly Glu Glu He Glu 
1 5 10 is 

Val Asn Cys Thr Ala Met Ala Ser Lys Pro Ala Thr Thr He Arg Trp 
20 25 30 

Phe Lys Gly Asn Thr Glu Leu Lys Gly Lys Ser Glu Val Glu Glu Trp 
35 40 45 

Ser Asp Met Tyr Thr Val Thr Ser Gin Leu Met Leu Lys Val His Lys 
50 55 60 

Glu Asp Asp Gly Val Pro Val He Cys Gin Val Glu His Pro Ala Val 
65 70 75 80 

Thr Gly Asn Leu Gin Thr Gin Arg Tyr Leu Glu Val Gin Tyr Lys Pro 
85 90 95 

Gin Val His He Gin Met Thr Tyr Pro Leu Gin Gly Leu Thr Arg Glu 
100 105 HO 

Gly Asp Ala Leu Glu Leu Thr Cys Glu Ala He Gly Lys Pro Gin Pro 
115 120 125 

Val Met Val Thr Trp Val Arg Val Asp Asp Glu Met Pro Gin His Ala 
130 135 140 

Val Leu Ser Gly Pro Asn Leu Phe lie Asn Asn Leu Asn Lys Thr Asp 
145 150 155 160 

Asn Gly Thr Tyr Arg Cys Glu Ala Ser Asn He Val Gly Lys Ala His 
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165 170 175 

Ser Asp Tyr Met Leu Tyr Val Tyr Asp Pro Pro Thr Thr He Pro Pro 
180 185 190 

Pro Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr He Leu 
195 200 205 

Thr He He Thr Asp Ser Pro Ser Gin Val Lys Lys Ala Arg Ser Gly 
210 215 220 

Gin Trp He Met Pro 
225 



<210> 47 

<211> 724 

<212> PRT 

<213> Homo sapiens 

<400> 47 

Met Leu Arg Thr Ser Thr Pro Asn Leu Cys Gly Gly Leu His Cys Arg 
15 10 15 

Ala Pro Trp Leu Ser Ser Gly He Leu Cys Leu Cys Leu He Phe Leu 
20 25 30 

Leu Gly Gin Val Gly Leu Leu Gin Gly His Pro Gin Cys Leu Asp Tyr 
35 40 45 

Gly Pro Pro Phe Gin Pro Pro Leu His Leu Glu Phe Cys Ser Asp Tyr 
50 55 60 

Glu Ser Phe Gly Cys Cys Asp Gin His Lys Asp Arg Arg He Ala Ala 
65 70 75 80 

Arg Tyr Trp Asp He Met Glu Tyr Phe Asp Leu Lys Arg His Glu Leu 
85 90 95 

Cys Gly Asp Tyr He Lys Asp He Leu Cys Gin Glu Cys Ser Pro Tyr 
100 105 no 

Ala Ala His Leu Tyr Asp Ala Glu Asn Thr Gin Thr Pro Leu Arg Asn 
U5 120 125 

Leu Pro Gly Leu Cys Ser Asp Tyr Cys Ser Ala Phe His Ser Asn Cys 
130 135 140 

His Ser Ala He Ser Leu Leu Thr Asn Asp Arg Gly Leu Gin Glu Ser 
145 150 155 160 

His Gly Arg Asp Gly Thr Arg Phe Cys His Leu Leu Asp Leu Pro Asp 
165 170 175 

Lys Asp Tyr Cys Phe Pro Asn Val Leu Arg Asn Asp Tyr Leu Asn Arg 
180 185 190 

His Leu Gly Met Val Ala Gin Asp Pro Gin Gly Cys Leu Gin Leu Cys 
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195 200 205 

Leu Ser Glu Val Ala Asn Gly Leu Arg Asn Pro Val Ser Met Val His 
210 215 220 

Ala Gly Asp Gly Thr His Arg Phe Phe Val Ala Glu Gin Val Gly Val 
225 230 235 240 

Val Trp Val Tyr Leu Pro Asp Gly Ser Arg Leu Glu Gin Pro Phe Leu 
245 250 255 

Asp Leu Lys Asn lie Val Leu Thr Thr Pro Trp He Gly Asp Glu Arg 
260 265 270 

Gly Phe Leu Gly Leu Ala Phe His Pro Lys Phe Arg His Asn Arg Lys 
275 280 285 

Phe Tyr He Tyr Tyr Ser Cys Leu Asp Lys Lys Lys Val Glu Lys He 
290 295 300 

Arg He Ser Glu Met Lys Val Ser Arg Ala Asp Pro Asn Lys Ala Asp 
305 310 315 320 

Leu Lys Ser Glu Arg Val He Leu Glu He Glu Glu Pro Ala Ser Asn 
325 330 335 

His Asn Gly Gly Gin Leu Leu Phe Gly Leu Asp Gly Tyr Met Tyr He 
340 345 350 

Phe Thr Gly Asp Gly Gly Gin Ala Gly Asp Pro Phe Gly Leu Phe Gly 
355 360 365 

Asn Ala Gin Asn Lys Ser Ser Leu Leu Gly Lys Val Leu Arg He Asp 
370 375 380 

Val Asn Arg Ala Gly Ser His Gly Lys Arg Tyr Arg Val Pro Ser Asp 
385 390 395 400 

Asn Pro Phe Val Ser Glu Pro Gly Ala His Pro Ala He Tyr Ala Tyr 
405 410 415 

Gly He Arg Asn Met Trp Arg Cys Ala Val Asp Arg Gly Asp Pro He 
420 425 430 

Thr Arg Gin Gly Arg Gly Arg He Phe Cys Gly Asp Val Gly Gin Asn 
435 440 445 

Arg Phe Glu Glu Val Asp Leu He Leu Lys Gly Gly Asn Tyr Gly Trp 
450 455 460 

Arg Ala Lys Glu Gly Phe Ala Cys Tyr Asp Lys Lys Leu Cys His Asn 
465 470 475 480 

Ala Ser Leu Asp Asp Val Leu Pro He Tyr Ala Tyr Gly His Ala Val 
485 490 495 

Gly Lys Ser Val Thr Gly Gly Tyr Val Tyr Arg Gly Cys Glu Ser Pro 
500 505 510 
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Asn Leu Asn Gly Leu Tyr lie Phe Gly Asp Phe Met Ser Gly Arg Leu 
515 520 525 

Met Ala Leu Gin Glu Asp Arg Lys Asn Lys Lys Trp Lys Lys Gin Asp 
530 535 540 

Leu Cys Leu Gly Ser Thr Thr Ser Cys Ala Phe Pro Gly Leu He Ser 
545 550 555 560 

Thr His Ser Lys Phe He He Ser Phe Ala Glu Asp Glu Ala Gly Glu 
565 570 575 

Leu Tyr Phe Leu Ala Thr Ser Tyr Pro Ser Ala Tyr Ala Pro Arg Gly 
580 585 590 

Ser He Tyr Lys Phe Val Asp Pro Ser Arg Arg Ala Pro Pro Gly Lys 
595 600 605 

Cys Lys Tyr Lys Pro Val Pro Val Arg Thr Lys Ser Lys Arg He Pro 
610 615 620 

Phe Arg Pro Leu Ala Lys Thr Val Leu Asp Leu Leu Lys Glu Gin Ser 
625 630 635 640 

Glu Lys Ala Ala Arg Lys Ser Ser Ser Ala Thr Leu Ala Ser Gly Pro 
645 650 655 

Ala Gin Gly Leu Ser Glu Lys Gly Ser Ser Lys Lys Leu Ala Ser Pro 
660 665 670 

Thr Ser Ser Lys Asn Thr Leu Arg Gly Pro Gly Thr Lys Lys Lys Ala 
675 680 685 

Arg Val Gly Pro His Val Arg Gin Gly Lys Arg Arg Lys Ser Leu Lys 
690 695 700 

Ser His Ser Gly Arg Met Arg Pro Ser Ala Glu Gin Lys Arg Ala Gly 
705 710 715 720 



Arg Ser Leu Pro 



<210> 48 

<211> 995 

<212> PRT 

<213> Homo sapiens 

<400> 48 

Met Pro Lys Pro Thr Pro Asn Ser 
1 5 

Gly Cys Arg Thr Gly Met His Met 
20 



Glu Arg Val Ser Val Arg Phe Pro 
10 15 

He Ser Val Ser Leu Arg Leu Val 
25 30 



Phe Cys Ser Phe He Phe Lys Ala Gly Val Leu Leu Gly His Pro Gin 
35 40 45 
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Cys Leu Asp Tyr Gly Pro Pro Phe Lys Pro Leu Val His Leu Glu Phe 
50 55 60 

Cys Ser Glu Tyr Glu Thr Phe Gly Cys Cys Asp Gin Asp Arg Asp Asn 
65 70 75 80 

Val He Ala Glu Lys Tyr Trp Ser He Met Asp Tyr Phe Asp Leu Asn 
85 90 95 

Asn Tyr His He Cys Gly Gly Tyr He Lys Asp He Leu Cys Gin Glu 
100 105 no 

Cys Ser Pro Tyr Ala Ala His Leu Tyr Asp Ala Glu Asp Pro His Thr 
115 120 125 

Pro Leu Arg Val He Pro Gly Leu Cys Phe Asn Tyr Cys Ser Glu Phe 
130 135 140 

His Leu Lys Cys Gin Asn Ser He Thr Leu Leu Thr Glu Asp Lys Gin 
145 150 155 160 

He Arg Glu Ser Cys Asp Lys Gly Arg Asp Leu Phe Cys Ser Leu Leu 
165 170 175 

Asn Leu Pro Asp Glu Asp Tyr Cys Phe Pro Asn Val Leu His Asn Thr 
180 185 190 

Glu Leu Asn Asn Asn Leu Gly Ser Val Val Glu Asp Pro Glu Gly Cys 
195 200 205 

He Lys Leu Cys Leu He Glu Val Ala Asn Gly Leu Arg Asn Pro Val 
210 215 220 

Leu Met Leu His Ala Asn Asp Gly Thr His Arg Met Phe Val Ala Glu 
225 230 235 240 

Gin He Gly Phe Val Trp Val Tyr Leu Pro Asp Gly Ser. Arg Leu Tyr 
245 250 255 

Glu Pro Phe Leu Asn Leu Arg Arg Thr Val Leu Ala Thr Pro Trp Leu 
260 265 270 

Gly Asp Glu Arg Gly Leu Leu Gly Met Ala Phe His Pro Lys Tyr Gin 
275 280 285 

Asn Asn Arg Lys Phe Tyr Val Tyr Tyr Ser He Met Asp Glu Tyr Arg 
290 295 300 

Asn Glu Lys He Arg He Ser Glu Phe Gin Val Glu Glu His Asp He 
305 310 315 . 320 

Asn Lys Ala Asp Pro Tyr Ser Glu Arg Arg He Leu Glu He Glu Glu 
325 330 335 

Pro Ala Ala Asn His Asn Gly Gly Gin lie Leu Phe Gly Lys Asp Gly 
340 345 350 
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Tyr Leu Tyr lie Phe Thr Gly Asp Gly Gly Lys Ala Gly Asp Pro Phe 
355 360 365 

Gly Arg Phe Gly Asn Ala Gin Asn Lys Ser Val Leu Leu Gly Lys Val 
370 375 380 

Leu Arg He Asp Val Asp Gly Arg Arg Ala Asn Gly Lys Pro Tyr Gly 
385 3 90 395 400 

He Pro Ser Asp Asn Pro Phe Leu Ser Glu Arg Gly Ala Ala Pro Glu 
405 410 415 

Val His Ala Tyr Gly Val Arg Asn Met Trp Arg Cys Ser Val Asp Gin 
420 425 430 

Gly Asp Pro Val Thr Gly Arg Gly Lys Gly Arg He Phe Cys Gly Asp 
435 440 445 

Val Gly Gin Asn Arg Phe Gly Glu Asp Asp He He Val He Gly Gly 
450 455 460 

Asn Tyr Gly Trp Arg Ala Lys Glu Gly Phe Glu Cys Phe Asp Leu Lys 
465 470 4 75 480 

Leu Cys Gin Asn Ser Ser Leu Asp Asp He Leu Pro He Phe Ala Tyr 
485 490 495 

Gly His Gin Val Gly Lys Ser Val Thr Gly Gly Tyr Val Tyr Arg Gly 
500 505 sio 

Cys Glu Ser Pro Asn Leu Asn Gly Val Tyr He Phe Gly Asp Phe Met 
515 520 525 

Asn Gly Arg Leu Met Ala Leu Gin Glu Asp Gly Val Thr Gly Thr Trp 
530 535 540 

Lys Lys Gin Asp He Cys Met Gly Asp Ser Thr He Cys Ala Phe Pro 
545 550 555 560 

Arg Leu He Asn Lys Tyr Ser Lys Phe lie lie Ser Phe Gly Glu Asp 
565 570 575 

Glu Ala Gly Glu Leu Leu Phe Leu Ser Thr Ser Gin Ala Ser Ala Tyr 
580 585 590 

Ser Pro Gin Gly Ser lie Tyr Lys Leu Val Asp Pro Ser Arg Arg Ala 
595 600 605 

Ala Pro Gly Lys Cys Lys Tyr Lys Pro Val Pro Val Lys Thr Arg Ser 
610 615 620 

Lys Leu Val Pro Phe He Pro Lys Glu Lys Thr Val Leu Glu He Val 
625 630 635 6 40 

Asn Glu Ser Val Lys Pro Thr Lys Ala Pro Arg Lys Lys Thr Pro Thr 
645 650 655 

Lys Phe Pro Thr Lys Val Pro Pro Thr Pro Thr Lys Phe Pro Thr Lys 
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660 665 670 

Val Pro Pro Thr Pro Thr Gin Phe Pro Thr Lys Val Pro Pro He Pro 
675 680 685 

Thr Lys Val Pro Ser Lys Val Pro Pro Thr Pro Thr Gin Phe Pro Thr 
690 695 700 

Lys Val Pro Pro Thr Pro Thr Lys Val Ser Thr Lys Val Leu Ser Thr 
705 710 715 720 

Pro Thr He Ala His Thr Lys Val Ser Pro Thr Ser Thr Lys Leu Pro 
725 730 735 

Ser Lys Ala Pro Ser Thr Gin Thr Met Val Pro Thr Lys Val His Pro 
740 745 750 

Thr Pro Thr Lys Leu Pro Thr Lys Val Pro Pro He Thr Thr Lys Val 
755 760 765 

Ser Asn Lys Val Leu Leu Thr Ser Pro Glu Leu Pro Thr Lys Val Pro 
770 775 780 

Pro Thr Pro Thr Lys Leu Pro Thr Asn Ala Pro Pro Thr Ser He Leu 
785 790 795 800 

Leu Ser Pro Thr Pro He Lys Leu Pro Thr Lys He Ser Leu Thr Leu 
805 810 815 

Thr Ser Val Pro He Lys Asn Gin Leu Thr Ser Ala Lys Leu Leu Thr 
820 825 830 

Thr Thr Leu Pro He Ser Thr Lys Arg Ala Thr Lys Leu Pro Ser Thr 
835 840 845 

Ser Thr Ser Val Pro Ser Asn Thr Ser Cys He Leu Thr His Val Gin 
850 855 860 

Pro Lys Met Leu Pro Thr Glu Thr Arg Val Pro Asn Lys Met Pro Pro 
865 870 875 880 

Lys Pro Thr Arg He Pro Thr Met Ser Met Tyr He Thr Lys Lys Pro 
885 890 895 

Pro Leu Lys Lys Asn Ser Ala Lys Lys Val Thr Asp Lys Arg Pro Thr 
900 905 910 

Lys Ser Pro Lys Thr Thr Lys Pro Pro Lys Pro Pro Lys Ser Lys Thr 
915 920 925 

Ser Val Val Asn Gin Pro Lys Lys Lys Glu Thr Lys Thr Gly Val Asn 
930 935 940 

Asn Lys Thr Lys Asn Leu Pro Pro Lys Ala Lys Glu Pro Lys Lys Glu 
945 950 955 960 

Lys Lys Thr lie Lys Val Lys Gin Pro Val Ser His Tyr Phe Pro Pro 
965 970 975 
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Gin Lys Pro Lys Lys Gin Lys lie Lys Lys Met Gin Lys Glu Gly Asn 
980 985 990 

Glu Lys Ser 
995 



<210> 49 

<211> 700 

<212> PRT 

<213> Homo sapiens 

<400> 49 

Met Leu Lys Met Leu Ser Phe Lys Leu Leu Leu Leu Ala Val Ala Leu 
1 5 io 15 

Gly Phe Phe Glu Gly Asp Ala Lys Phe Gly Glu Arg Ser Glu Gly Ser 
20 25 30 

Gly Ala Arg Arg Arg Arg Cys Leu Asn Gly Asn Pro Pro Lys Arg Leu 
35 40 45 

Lys Arg Arg Asp Arg Arg Val Met Ser Gin Leu Glu Leu Leu Ser Glv 
50 55 60 

Gly Glu He Leu Cys Gly Gly Phe Tyr Pro Arg Val Ser Cys Cys Leu 
65 70 75 80 

Gin Ser Asp Ser Pro Gly Leu Gly Arg Leu Glu Asn Lys He Phe Ser 
85 90 95 

Ala Thr Asn Asn Ser Glu Cys Ser Arg Leu Leu Glu Glu He Gin Cys 
100 105 no 

Ala Pro Cys Ser Pro His Ser Gin Ser Leu Phe Tyr Thr Pro Glu Arg 
H5 120 125 

r Asp Val Leu Asp Gly Asp Leu Ala Leu Pro Leu Leu Cys Lys Asp Tyr 
130 135 i 40 

* Cys Lys Glu Phe Phe Tyr Thr Cys Arg Gly His He Pro Gly Leu Leu 
145 150 155 160 

Gin Thr Thr Ala Asp Glu Phe Cys Phe Tyr Tyr Ala Arg Lys Asp Ala 
165 170 175 

Gly Leu Cys Phe Pro Asp Phe Pro Arg Lys Gin Val Arg Gly Pro Ala 
180 185 190 

Ser Asn Tyr Leu Gly Gin Met Glu Asp Tyr Glu Lys Val Gly Gly He 
195 200 205 

Ser Arg Lys His Lys His Asn Cys Leu Cys Val Gin Glu Val Met Ser 
210 215 220 

Gly Leu Arg Gin Pro Val Ser Ala Val His Ser Gly Asp Gly Ser His 
225 2 30 235 240 
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Arg Leu Phe lie Leu Glu Lys Glu Gly Tyr Val Lys He Leu Thr Pro 
245 250 255 

Glu Gly Glu Leu Phe Lys Glu Pro Tyr Leu Asp He His Lys Leu Val 
260 265 270 

Gin Ser Gly He Lys Gly Gly Asp Glu Arg Gly Leu Leu Ser Leu Ala 
275 280 . 285 

Phe His Pro Asn Tyr Lys Lys Asn Gly Lys Leu Tyr Val Ser Tyr Thr 
290 295 300 

Thr Asn Gin Glu Arg Trp Ala He Gly Pro His Asp His He Leu Arg 
305 310 315 320 

Val Val Glu Tyr Thr Val Ser Arg Lys Asn Pro His Gin Val Asp Val 
325 330 335 

Arg Thr Ala Arg Val Phe Leu Glu Val Ala Glu Leu His Arg Lys His 
340 345 350 

Leu Gly Gly Gin Leu Leu Phe Gly Pro Asp Gly Phe Leu Tyr He He 
. 355 360 365 

Leu Gly Asp Gly Met He Thr Leu Asp Asp Met Glu Glu Met Asp Gly 
37t > 375 380 

Leu Ser Asp Phe Thr Gly Ser Val Leu Arg Leu Asp Val Asp Thr Asp 
385 3 90 395 400 

Met Cys Asn Val Pro Tyr Ser He Pro Arg Ser Asn Pro His Phe Asn 
405 410 415 

Ser Thr Asn Gin Pro Pro Glu Val Phe Ala His Gly Leu His Asp Pro 
420 425 430 

Gly Arg Cys Ala Val Asp Arg His Pro Thr Asp He Asn He Asn Leu 
43 5 440 445 

Thr He Leu Cys Ser Asp Ser Asn Gly Lys Asn Arg Ser Ser Ala Arg 
450 455 460 

He Leu Gin He He Lys Gly Arg Asp Tyr Glu Ser Glu Pro Ser Leu 
465 47 0 475 480 

Leu Glu Phe Lys Pro Phe Ser Asn Gly Pro Leu Val Gly Gly Phe Val 
485 490 495 

Tyr Arg Gly Cys Gin Ser Glu Arg Leu Tyr Gly Ser Tyr Val Phe Gly 
500 505 510 

Asp Arg Asn Gly Asn Phe Leu Thr Leu Gin Gin Ser Pro Val Thr Lys 
515 520 525 

Gin Trp Gin Glu Lys Pro Leu Cys Leu Gly Ala Ser Ser Ser Cys Arg 
530 535 540 
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Gly Tyr Phe Ser Gly His lie Leu Gly Phe Gly Glu Asp Glu Leu Gly 
545 550 555 560 

Glu Val Tyr He Leu Ser Ser Ser Lys Ser Met Thr Gin Thr His Asn 
565 570 575 

Gly Lys Leu Tyr Lys He Val Asp Pro Lys Arg Pro Leu Met Pro Glu 
580 585 590 

Glu Cys Arg Val Thr Val Gin Pro Ala Gin Pro Leu Thr Ser Asp Cys 
595 600 605 

Ser Arg Leu Cys Arg Asn Gly Tyr Tyr Thr Pro Thr Gly Lys Cys Cys 
610 615 620 

Cys Ser Pro Gly Trp Glu Gly Asp Phe Cys Arg He Ala Lys Cys Glu 
625 630 635 640 

Pro Ala Cys Arg His Gly Gly Val Cys Val Arg Pro Asn Lys Cys Leu 
645 650 655 

Cys Lys Lys Gly Tyr Leu Gly Pro Gin Cys Glu Gin Val Asp Arg Asn 
660 665 670 

Val Arg Arg Val Thr Arg Ala Gly He Leu Asp Gin He He Asp Met 
675 680 685 

Thr Ser Tyr Leu Leu Asp Leu Thr Ser Tyr He Val 
690 695 700 



<210> 50 
<211> 567 
<212> PRT 

<213> Homo sapiens 
<400> 50 

Thr Ser Thr Pro Pro Arg Ala Val Pro Leu Pro Lys Ser Ser Gin Ala 
1 5 10 15 

Ala His Gin Arg Asn Cys Asn Ser Gly Trp Ser Pro Gly Pro Ala Ser 
20 25 30 

Leu Gly Val Arg Gly Ser Val Cys Pro Ala He Cys Trp Trp His Leu 
35 40 45 

Ser Leu Leu Pro Pro Pro Ser Val Asn Pro Thr Leu Gin Lys Cys Ser 
50 55 60 

Ser Pro Gly Ala Ala Gin Glu Leu Ser Met Arg Pro Pro Gly Phe Arg 
65 70 75 80 

Asn Phe Leu Leu Leu Ala Ser Ser Leu Leu Phe Ala Gly Leu Ser Ala 
85 90 95 



Val Pro Gin Ser 
100 



Phe Ser Pro Ser Leu Arg Ser Trp Pro Gly Ala Ala 
105 HO 
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Cys Arg Leu Ser Arg Ala Glu Ser Glu Arg Arg Cys Arg Ala Pro Gly 
115 120 125 

Gin Pro Pro Gly Ala Ala Leu Cys His Gly Arg Gly Arg Cys Asp Cys 
130 135 140 

Gly Val Cys He Cys His Val Thr Glu Pro Gly Met Phe Phe Gly Pro 
145 150 155 160 

Leu Cys Glu Cys His Glu Trp Val Cys Glu Thr Tyr Asp Gly Ser Thr 
165 170 175 

Cys Ala Gly His Gly Lys Cys Asp Cys Gly Lys Cys Lys Cys Asp Gin 
180 185 190 

Gly Trp Tyr Gly Asp Ala Cys Gin Tyr Pro Thr Asn Cys Asp Leu Thr 
195 200 205 

Lys Lys Lys Ser Asn Gin Met Cys Lys Asn Ser Gin Asp He He Cys 
210 215 220 

Ser Asn Ala Gly Thr Cys His Cys Gly Arg Cys Lys Cys Asp Asn Ser 
225 230 235 240 

Asp Gly Ser Gly Leu Val Tyr Gly Lys Phe Cys Glu Cys Asp Asp Arg 
245 250 255 

Glu Cys He Asp Asp Glu Thr Glu Glu He Cys Gly Gly His Gly Lys 
260 265 270 

Cys Tyr Cys Gly Asn Cys Tyr Cys Lys Ala Gly Trp His Gly Asp Lys 
275 280 285 

Cys Glu Phe Gin Cys Asp He Thr Pro Trp Glu Ser Lys Arg Arg Cys 
290 295 300 

Thr Ser Pro Asp Gly Lys He Cys Ser Asn Arg Gly Thr Cys Val Cys 
305 310 315 320 

Gly Glu Cys Thr Cys His Asp Val Asp Pro Thr Gly Asp Trp Gly Asp 
325 330 335 

He His Gly Asp Thr Cys Glu Cys Asp Glu Arg Asp Cys Arg Ala Val 
340 345 350 

Tyr Asp Arg Tyr Ser Asp Asp Phe Cys Ser Gly His Gly Gin Cys Asn 
355 360 365 

Cys Gly Arg Cys Asp Cys Lys Ala Gly Trp Tyr Gly Lys Lys Cys Glu 
370 375 380 

His Pro Gin Ser Cys Thr Leu Ser Ala Glu Glu Ser lie Arg Lys Cys 
385 390 395 400 

Gin Gly Ser Ser Asp Leu Pro Cys Ser Gly Arg Gly Lys Cys Glu Cys 
405 410 415 

Gly Lys Cys Thr Cys Tyr Pro Pro Gly Asp Arg Arg Val Tyr Gly Lys 
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420 425 43Q 

Thr Cys Glu Cys Asp Asp Arg Arg Cys Glu Asp Leu Asp Gly Val Val 
435 440 445 

Cys Gly Gly His Gly Thr Cys Ser Cys Gly Arg Cys Val Cys Glu Arg 
4 $0 455 460 

Gly Trp Phe Gly Lys Leu Cys Gin His Pro Arg Lys Cys Asn Met Thr 
465 470 475 480 

Glu Glu Gin Ser Lys Asn Leu Cys Glu Ser Ala Asp Gly He Leu Cys 
485 490 495 

Ser Gly Lys Gly Ser Cys His Cys Gly Lys Cys He Cys Ser Ala Glu 
500 505 510 

Glu Trp Tyr He Ser Gly Glu Phe Cys Asp Cys Asp Asp Arg Asp Cys 
515 520 525 

Asp Lys His Asp Gly Leu He Cys Thr Gly Asn Gly He Cys Ser Cys 
530 535 540 

Gly Asn Cys Glu Cys Trp Asp Gly Trp Asn Gly Asn Ala Cys Glu He 
545 550 555 560 

Trp Leu Gly Ser Glu Tyr Pro 
565 



<210> 51 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<400> 51 

Gly Lys Cys Asp Cys Gly Lys Cys Lys Cys Asp Gin Gly Trp Tyr Gly 
1 5 10 15 

Asp Ala Cys Gin Tyr Pro Thr Asn Cys Asp Leu Thr Lys 
20 25 



<210> 52 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<400> 52 

Gly Gly His Gly Lys Cys Tyr Cys 
1 5 

Trp His Gly Asp Lys Cys Glu Phe 
20 



Gly Asn Cys Tyr Cys Lys Ala Gly 
10 15 

Gin Cys Asp He Thr 
25 



<210> 53 
<211> 29 
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<212> PRT 

<213> Homo sapiens 

<400> 53 

His Gly Gin Cys Asn Cys Gly Arg Cys Asp Cys Lys Ala Gly Trp Tyr 
1 5 10 15 

Gly Lys Lys Cys Glu His Pro Gin Ser Cys Thr Leu Ser 
20 25 



<210> 54 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<400> 54 

His Gly Thr Cys Ser Cys Gly Arg Cys Val Cys Glu Arg Gly Trp Phe 
1 5 io 15 

Gly Lys Leu Cys Gin His Pro Arg Lys Cys Asn Met Thr 
20 25 



<210> 55 

<211> 29 

<212> PRT 

<213> Homo sapiens 



<400> 55 



Gly Asn Gly He Cys Ser Cys Gly Asn Cys Glu Cys Trp Asp Gly Trp 
1 5 10 15 

Asn Gly Asn Ala Cys Glu lie Trp Leu Gly Ser Glu Tyr 
20 25 



<210> 56 

<211> 29 

<212> PRT 

<213> Homo sapiens 



<400> 56 

Gly Asn Gly He Cy S Ser Cys Gly Asn Cys Glu Cys Trp Asp Gly Trp 
1 5 io 15 

Asn Gly Asn Ala Cys Glu lie Trp Leu Gly Ser Glu Tyr 
20 25 



<210> 57 

<211> 29 

<212> PRT 

<213> Homo sapiens 



<400> 57 

Gly Asn Gly He Cys Ser Cys Gly Asn Cys Glu Cys Trp Asp Gly Trp 
1 5 io 15 
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Asn Gly Asn Ala Cys Glu He Trp Leu Gly Ser Glu Tyr 
20 25 



<210> 58 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 58 

Glu Thr Tyr Asp Gly Ser Thr Cys Ala Gly His Gly Lys Cys Asp Cys 
1 5 10 15 

Gly Lys Cys Lys Cys Asp Gin Gly Trp Tyr Gly Asp Ala Cys Gin Tyr 
20 25 30 

Pro 



<210> 59 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 59 

Met Cys Lys Asn Ser Gin Asp He 
1 5 

His Cys Gly Arg Cys Lys Cys Asp 
20 

Tyr Gly 



He Cys Ser Asn Ala Gly Thr Cys 
10 is 

Asn Ser Asp Gly Ser Gly Leu Val 
25 30 



<210> 60 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 60 

He Asp Asp Glu Thr Glu Glu He 
1 5 

Cys Gly Asn Cys Tyr Cys Lys Ala 
20 



Cys Gly Gly His Gly Lys Cys Tyr 
10 15 

Gly Trp His Gly Asp Lys Cys 
25 30 



<210> 61 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 61 



Lys Arg Arg Cys Thr Ser Pro Asp Gly Lys lie Cys Ser Asn Arg Gly 
1 5 io 15 
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Thr Cys Val Cys Gly Glu Cys Thr Cys His Asp Val Asp Pro Thr Gly 
20 25 30 



Asp Trp 



<210> 62 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 62 

Asp Arg Tyr Ser Asp Asp Phe Cys 
1 5 

Gly Arg Cys Asp Cys Lys Ala Gly 
20 

Pro Gin 



Ser Gly His Gly Gin Cys Asn Cys 
10 15 

Trp Tyr Gly Lys Lys Cys Glu His 
25 30 



<210> 63 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 63 

Cys Gin Gly Ser Ser Asp Leu Pro 
1 5 

Cys Gly Lys Cys Thr Cys Tyr Pro 
20 

Lys 



Cys Ser Gly Arg Gly Lys Cys Glu 
10 15 

Pro Gly Asp Arg Arg Val Tyr Gly 
25 30 



<210> 64 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 64 

Cys Glu Asp Leu Asp Gly Val Val 
1 5 

Cys Gly Arg Cys Val Cys Glu Arg 
20 



Cys Gly Gly His Gly Thr Cys Ser 
10 15 

Gly Trp Phe Gly Lys Leu Cys 
25 30 



<210> 65 

<211> 32 

<212> PRT 

<213> Homo sapiens 
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<400> 65 

Ser Ala Asp Gly lie Leu Cys Ser Gly Lys Gly Ser Cys His Cys Gly 
15 10 15 

Lys Cys lie Cys Ser Ala Glu Glu Trp Tyr He Ser Gly Glu Phe Cys 
20 25 30 



<210> 66 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 66 

Cys Asp Lys His Asp Gly Leu He Cys Thr Gly Asn Gly He Cys Ser 
1 5 io 15 

Cys Gly Asn Cys Glu Cys Trp Asp Gly Trp Asn Gly Asn Ala Cys Glu 
20 25 30 

He 



<210> 67 

<211> 769 

<212> PRT 

<213> Homo sapiens 

<400> 67 

Met Cys Gly Ser Ala Leu Ala Phe Phe Thr Ala Ala Phe Val Cys Leu 
1 5 io 15 

Gin Asn Asp Arg Arg Gly Pro Ala Ser Phe Leu Trp Ala Ala Trp Val 
20 25 30 

Phe Ser Leu Val Leu Gly Leu Gly Gin Gly Glu Asp Asn Arg Cys Ala 
35 40 . 45 

Ser Ser Asn Ala Ala Ser Cys Ala Arg Cys Leu Ala Leu Gly Pro Glu 
5 ° 55 60 

Cys Gly Trp Cys Val Gin Glu Asp Phe lie Ser Gly Gly Ser Arg Ser 
65 70 75 80 

Glu Arg Cys Asp He Val Ser Asn Leu lie Ser Lys Gly Cys Ser Val 
85 90 95 

Asp Ser He Glu Tyr Pro Ser Val His Val lie lie Pro Thr Glu Asn 
100 105 no 

Glu lie Asn Thr Gin Val Thr Pro Gly Glu Val Ser lie Gin Leu Arg 
US 120 125 

Pro Gly Ala Glu Ala Asn Phe Met Leu Lys Val His Pro Leu Lys Lys 
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"0 135 140 

Tyr Pro Val Asp Leu Tyr Tyr Leu Val Asp val Ser Ala Ser Met His 
145 150 155 16Q 

Asn Asn He Glu Lys Leu Asn Ser Val Gly Asn Asp Leu Ser Arg Lys 
165 170 175 

Met Ala Phe Phe Ser Arg Asp Phe Arg Leu Gly Phe Gly Ser Tyr Val 
180 185 190 

Asp Lys Thr Val Ser Pro Tyr He Ser He His Pro Glu Arg He His 
I 95 200 205 

Asn Gin Cys Ser Asp Tyr Asn Leu Asp Cys Met Pro Pro His Gly Tyr 
210 215 220 

He His Val Leu Ser Leu Thr Glu Asn He Thr Glu Phe Glu Lys Ala 
225 230 235 240 

Val His Arg Gin Lys He Ser Gly Asn He Asp Thr Pro Glu Gly Gly 
245 250 255 

Phe Asp Ala Met Leu Gin Ala Ala Val Cys Glu Ser His He Gly Trp 
260 265 270 

Arg Lys Glu Ala Lys Arg Leu Leu Leu Val Met Thr Asp Gin Thr Ser 
275 280 285 

His Leu Ala Leu Asp Ser Lys Leu Ala Gly He Val Val Pro Asn Asp 
290 295 300 

Gly Asn Cys His Leu Lys Asn Asn Val Tyr Val Lys Ser Thr Thr Met 
305 310 315 320 

Glu His Pro Ser Leu Gly Gin Leu Ser Glu Lys Leu He Asp Asn Asn 
325 330 335 

He. Asn Val He Phe Ala Val Gin Gly Lys Gin Phe His Trp Tyr Lys 
340 345 350 

Asp Leu Leu Pro Leu Leu Pro Gly Thr He Ala Gly Glu He Glu Ser 
355 360 365 

Lys Ala Ala Asn Leu Asn Asn Leu Val Val Glu Ala Tyr Gin Lys Leu 
370 375 380 

He Ser Glu Val Lys Val Gin Val Glu Asn Gin Val Gin Gly He Tyr 
385 39 ° 395 400 

Phe Asn He Thr Ala He Cys Pro Asp Gly Ser Arg Lys Pro Gly Met 
405 410 415 

Glu Gly Cys Arg Asn Val Thr Ser Asn Asp Glu Val Leu Phe Asn Val 
420 425 430 

Thr Val Thr Met Lys Lys Cys Asp Val Thr Gly Gly Lys Asn Tyr Ala 
435 440 445 
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He He Lys Pro He Gly Phe Asn Glu Thr Ala Lys He His He His 

450 455 460 

Arg Asn Cys Ser Cys Gin Cys Glu Asp Asn Arg Gly Pro Lys Gly Lys 
465 470 475 480 

Cys Val Asp Glu Thr Phe Leu Asp Ser Lys Cys Phe Gin Cys Asp Glu 
485 490 495 

Asn Lys Cys His Phe Asp Glu Asp Gin Phe Ser Ser Glu Ser Cys Lys 
500 505 510 

Ser His Lys Asp Gin Pro Val Cys Ser Gly Arg Gly Val Cys Val Cys 
515 520 525 

Gly Lys Cys Ser Cys His Lys He Lys Leu Gly Lys Val Tyr Gly Lys 
530 535 540 

Tyr Cys Glu Lys Asp Asp Phe Ser Cys Pro Tyr His His Gly Asn Leu 
545 550 555 560 

Cys Ala Gly His Gly Glu Cys Glu Ala Gly Arg Cys Gin Cys Phe Ser 
565 570 575 

Gly Trp Glu Gly Asp Arg Cys Gin Cys Pro Ser Ala Ala Ala Gin His 
580 585 590 

Cys Val Asn Ser Lys Gly Gin Val Cys Ser Gly Arg Gly Thr Cys Val 
595 600 605 

Cys Gly Arg Cys Glu Cys Thr Asp Pro Arg Ser He Gly Arg Phe Cys 
610 615 620 

Glu His Cys Pro Thr Cys Tyr Thr Ala Cys Lys Glu Asn Trp Asn Cys 
625 630 6 35 640 

Met Gin Cys Leu His Pro His Asn Leu Ser Gin Ala He Leu Asp Gin 
645 650 655 

Cys Lys Thr Ser Cys Ala Leu Met Glu Gin Gin His Tyr Val Asp Gin 
660 665 670 

Thr Ser Glu Cys Phe Ser Ser Pro Ser Tyr Leu Arg He Phe Phe He 
675 680 685 

He Phe He Val Thr Phe Leu He Gly Leu Leu Lys Val Leu He He 
690 695 700 

Arg Gin Val He Leu Gin Trp Asn Ser Asn Lys He Lys Ser Ser Ser 
705 710 715 720 

Asp Tyr Arg Val Ser Ala Ser Lys Lys Asp Lys Leu He Leu Gin Ser 
725 730 735 

Val Cys Thr Arg Ala Val Thr Tyr Arg Arg Glu Lys Pro Glu Glu He 
740 745 750 
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Lys Met Asp He Ser Lys Leu Asn Ala His Glu Thr Phe Arg Cys Asn 
755 760 7 65 

Phe 



<210> 68 
<211> 550 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (29) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (445) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (460) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (462) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (477) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (492) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (501) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (513) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (523) 

<223> n equals a,t,g, or c 



<220> 
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<221> SITE 
<222> (534) 

<223> n equals a,t,g, or c 
<400> 68 



GGCACGAGTG 


GAAACTGCTA 


CTGCAAGGNT GGTTGGCATG GAGATAAATG TGAATTCCAG 


60 


TGCGATATCA 


CCCCCTGGGA 


AAGCAAGCGA AGATGCACGT CTCCAGATGG CAAAATCTGC 


120 


AATAACAGAG 


GGACTTGTGT 


ATGTGGTGAA TGTACCTGTC ACGATGTTGA TCCGACTGGG 


180 


GACTGGGGAG 


ATATTCATGG 


GGACACCTGT GAATGTGATG AGAGGGACTG TAfiAfiPTfiTr 




TATGACCGAT 


ATTCTGATGA 


CTTCTGTTCA GGTCATGGAC AGTGTAATTG CGGAAGATGT 


300 


GACTGCAAAG 


CAGGCTGGTT 


ATGGGAAGAA GTGTGAGCAC CCACAGTCCT GCACGCTGTC 


360 


AGCTGAGGAG 


AGCATCAGGA 


AGTGCCAAGG AAGCTCGGAT CTGCCTTGCT CTGGGAAGGG 


420 


TAAATGTGAA 


TGTGGCAAAT 


GCACNTGGTA TCCTCCAGGN GNTCCGCCGG GTGTATNGGC 


480 


AAACTTGTTG 


ANTTGTGATG 


NTCGCCGCTG TGNAAGACCC CGNGGTGTGG TCTNGGAAGC 


540 


CACGGCACAT 






550 



<210> 69 
<211> 495 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (10) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (56) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (69) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (91) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (258) 

<223> n equals a,t,g, or c 



<220> 
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<221> SITE 
<222> (302) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (321) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (338) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (349) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (350) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (355) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (359) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (362) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (363) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (369) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (378) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (386) 

<223> n equals a,t,g, or c 
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65 



PCT/US99/25031 



<220> 

<221> SITE 
<222> (387) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (388) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (395) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (401) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (423) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (448) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (468) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (494) 

<223> n equals a,t,g, or c 



<400> 69 

AATTCGGCAN AGGATTTTTT CTGGGCCATT AGAACATATA AATGCGGAGG AAACCNTGTA 60 

TATTCACCNC TAGGACAGGT TAAAAAGACC NTTGTATGTT TTTCTATTTC TGAATTACGA 120 

ATGAAATCCG AGTACCTATT AGAAATGAAG TTATGCAAAT TTAGATGCAA ATAACATTAG 180 

AAAAAAAAGA TTCTTCCATA ATTAACATAA GTGGTTCCTA ACGAGAGCAA TTTTTCCACC 240 

CAAAAGTCAT TTGGCAANAT CTACAGACCA TTTTTGATTG TCACACTGGG GTCGGGTAGG 300 

ANGTATGCTG CCAGACATTT NGGTGGGGTA GAGGGCCNGG GATGCTGCNN GGGCNTCCNC 360 

CNNTTGTTNC AGGCCGGNCC CCCCANNNAA GGGANTTTTT NCCCGGCCCC AAATGGCCCA 420 



TTNGGGGTTC AAACTTGGGG AACCCTTNGG GTTTTTTTGG GCTTTTTNGG AATTCCCCAT 



480 
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TTTTTTCCCC AGGNA 4g5 



<210> 70 

<211> 495 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (10) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (126) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (182) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (192) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (248) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (350) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (370) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (377) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (380) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (389) 

<223> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> (399) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (406) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (408) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (412) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (414) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (424) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (441) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (444) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (445) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (447) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (459) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
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<222> (464) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (466) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (467) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (468) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (473) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (480) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (483) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (484) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (485) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (488) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (492) 

<223> n equals a,t,g, or c 
<400> 70 

AATTCGGCAN AGCCAACATC CGCGGAAGTG TAACATGACG GAAGAACAAA GCAAGAATCT 



GTGTGAATCA GCAGATGGCA TATTGTGCTC GGGGAAGGGT TCTTGTCATT GTGGGAAGTG 



120 
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CATTTNTTCT 


GCTGAGGAGT 


GGTATATTTC 


TGGGGAGTTC 


TGTGAACTGT 


GAATGACAGA 


180 


GNACTGCGAC 


ANACATGGAT GGTCTCATTT GTACAGGGAA 


TGGAATATGT 


AGCTGTGGGA 


240 


AACTGTGNAA 


TGCTGGGGAT 


GGATGGGAAT 


GGGAAATGCA 


TGTGAAAATC 


TGGGCTTGGG 


300 


CTCCAGAATA 


TCCCTTTAAC 


CATTTACATG 


AGGAGAGGGT 


CTTGGATTCN 


TAATTTTTTC 


360 


CTGGGGGCCN 


TTAGGGNCCN 


TTTAAATGNC 


GGGGGGAANC 


CTGTTNTNTT 


TNCNCCCTGG 


420 


GGGNCGGTTT 


AAAAAGCCCC 


NTGNNTNTTT 


TTCCCTTTNC 


GGANTNNNGG 


GTNAAAACCN 


480 


GGNNNCCNTT 


TNGAA 










495 



<210> 71 
<211> 495 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (36) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (38) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (53) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (68) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (69) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (81) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (82) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
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<222> (103) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (105) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (109) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (113) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (123) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (131) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (134) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (158) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (224) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (238) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (243) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (258) 

<223> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> (268) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (313) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (319) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (327) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (338) 

<223> n equals a,t,g, or c 

<220> 
<221> SITE 
<222> (339) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (358) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (365) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (370) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (371) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (374) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (377) 
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<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (381) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (386) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (391) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (394) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (400) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (408) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (423) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (428) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (446) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (456) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (460) 

<223> n equals a,t,g, or c 



<220> 



WO 00/29435 PCT/US99/25031 

73 

<221> SITE 
<222> (463) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (472) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (487) 

<223> n equals a,t,g, or c 
<400> 71 



AATTCGGCAC 


AGGATTTTTT 


CTGGGCCATT 


AGAACNTNTA AATGCGAGGA AANCTTGTAT 


60 


ATTCACCNNT 


TGGACAGGTT 


NWAAAGACCG 


TTGTATGTTT TTNTNTTTNT GANTTACGGA 


120 


TGNAAATCCG 


NGTNACCTAT 


TAGAAATGGG 


TTATGCANAT TTAGATGCAA ATAACATTAG 


180 


AAAAAAAAGA 


TTCTTCCATA 


ATTAACATAA 


GTGGTTCCTA ACGNGAGCAA TTTTTCCNCC 


240 


CTNAAGTCAT 


TTGGCAANTC 


TACAGACNAT 


TTTGGTTGTC ACACTGGGTC GGGTAGGAAG 


300 


GTATGCTGGC 


AGNCATTTNG 


TGGGTANAGG 


GCCCTGGNNT GCTGTTGAAG CATCCCCNAG 


360 


TGTANCAGGN 


NCGNGCNCCA 


NACCANGGGG 


NTTNATCCCN GCCCCAANTG CCCATGGGGG 


420 


TTNAAACNTG 


GGGAAACGTT 


GGGGTNTTTT 


GGGCTNTTTN GGNAATTCCC CNTTTCTTTC 


480 


ACCAGGNAAG 


GCCCC 






495 



<210> 72 

<211> 317 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (120) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (121) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (185) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (267) 
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<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (288) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (295) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (307) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (308) 

<223> n equals a,t,g, or c 



<400> 72 

GGCACAGTCA AATTTTATCG TATTTATTTA GGAATTTGTC TTCTCATTAC TCAGAAGAAC 60 

ATTTATATAT TCTTGATAGG AATCCTTTGG CAGGCCATGG TAAGTGTGAC TGTGGCAAGN 120 

NGCAAGTGTG ACCAGGGATG GTATGGGGAT GCTTGCCAGT ACCCAACTAA CTGTGACTTG 180 

ACAANGAAGA AAAGTAACCA AATGTGGCAA GAATTCACAA GACATCATCT GCTCTAATTC 240 

AGGTACATGT CACTGTGGCA GTGTAANTGT GATAGATTCA GATGGAANTG GGACNTGGTG 300 

TATGGGNNAA TTTGTGG -117 



<210> 73 

<211> 36 

<212> PRT 

<213> Homo sapiens 



<400> 73 

He Cys Gly Gly His Gly Lys Cys 
1 5 

Ala Gly Trp His Gly Asp Lys Cys 
20 

Trp Glu Ser Lys 
35 



Tyr Cys Gly Asn Cys Tyr Cys Lys 

10 15 

Glu Phe Gin Cys Asp He Thr Pro 

25 30 



<210> 74 

<211> 35 

<212> PRT 

<213> Homo sapiens 



WO 00/29435 PCT/US99/25031 

75 

<400> 74 

Gly Gin Pro Pro Gly Ala Ala Leu Cys His Gly Arg Gly Arg Cys Asp 
1 5 10 15 

Cys Gly Val Cys He Cys His Val Thr Glu Pro Gly Met Phe Phe Gly 
20 25 30 

Pro Leu Cys 
35 



<210> 75 

<211> 148 

<212> PRT 

<213> Homo sapiens 

<400> 75 

Met Ala Pro Gly Leu Arg Gly Leu Pro Arg Cys Gly Leu Trp Leu Leu 
15 10 15 

Leu Ala His His Leu Phe Met Val Thr Ala Cys Arg Asp Pro Asp Tyr 
20 25 30 

Gly Thr Leu He Gin Glu Leu Cys Leu Ser Arg Phe Lys Glu Asn Met 
35 40 45 

Glu Thr He Gly Lys Thr Leu Trp Cys Asp Trp Gly Lys Thr He Gin 
50 55 60 

Ser Tyr Gly Glu Leu Thr Tyr Cys Thr Lys His Val Ala His Thr He 
65 70 75 80 

Gly Cys Phe Trp Pro Asn Pro Glu Val Asp Arg Phe Phe He Ala Val 
85 90 95 

His His Arg Tyr Phe Ser Lys Cys Pro He Ser Gly Arg Ala Leu Arg 
100 105 no 

Asp Pro Pro Asn Ser He Leu Cys Pro Phe He Ala Leu Pro He Thr 
115 120 125 

Val Thr Leu Leu Met Thr Ala Leu Val Val Trp Arg Ser Lys Arg Thr 
130 135 140 

Glu Gly He Val 
145 



<210> 76 

<211> 502 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (10) 

<223> n equals a,t,g, or c 



WO 00/29435 
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<220> 

<221> SITE 
<222> (170) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (209) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (284) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (304) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (338) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (347) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (356) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (367) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (379) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (405) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (415) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (420) 
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<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (424) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (448) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (454) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (459) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (465) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (472) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (485) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (487) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (498) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (501) 

<223> n equals a,t,g, or c 



<400> 76 

AATTCGGCAN AGCTGGTCAT GGGCAGACCC CTCCCTTCCT GGGCTGACCT GCTCCCTCGA 
GGCCAGCCTG CTCCCTGGCT GAGGCTCAGG CTATCCGCCC AAGCTCTTTG CTCATTCTAG 
GGCCAGTGGA GGAAAATGTG ATAAGGCCAG AGCTTGTGTG CTGGGCAAGN AAATCACCTG 



WO 00/29435 PCT/US99/25031 







78 




CTGCATCCTG 


TGCTCCGCAG 


CTGGGCCGNA GCCTCTGCCC GCAGTTTCTA TGCTGTTTCT 


240 


TAGCACAGAA 


TCCAGCCTAG 


CCTTAGCCGC AGTCTAGGCC CTGNTTGGGA CTAGGACTCC 


300 


TTGNTTGACC 


CCATCTTTGG 


TTCCTGCCTG GTTCCTGNAA CAGCCCNAGT TCTGGNTAAA 


360 


TCCAGGNAGA AAGTTAGGNA AGGGTTTTTG GAAGAAGTTC CGTGNTTTGA ACTTNGGAGN 


420 


CTTNTTGGTG 


GGAAGAAAAT 


TGGAAAGNTT GGGNAGGANA AAGGNGGGTT TNGGGGTTTC 


480 


AAGGNGNATG 


GGGGGTTNAA 


NA 


502 



<210> 77 

<211> 505 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (11) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (14) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (67) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (208) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (247) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (250) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (267) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (299) 
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<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (320) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (321) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (339) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (349) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (364) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (384) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (386) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (406) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (417) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (424) 

<223> n equals a,t,g, or c / 
<220> 

<221> SITE 
<222> (440) 

<223> n equals a,t,g, or c 



<220> 
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<221> SITE 
<222> (446) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (448) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (467) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (474) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (484) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (488) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (490) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (497) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (504) 

<223> n equals a,t,g, or c 



<400> 77 



GAATTCGGCA NAGNCTGCGG CCGGCCAGCC 


ATGGAGACTG 


GAGCGCTGCG 


ACGCCGCAAC 


60 


TTCTCCNGTT 


GCTGCTGCTG CTCTGCGGTG GGTGTCCCAG AGCAGGCGGC 


TGCAACGAGA 


120 


CAGGCATGTT 


GGAGAGGCTG CCCCTGTGTG 


GGAAAGGCTT 


TCGCAGACAT 


GATGGGCAAG 


180 


GTGGACGTCT 


GGGAATGGTG CAACTGTNCC 


GAGTTCATCG 


TGTACTATGA 


GAAGTTTCAC 


240 


CAACTGNCAN 


CGAGATGGAG GCCAATNTCG 


TGGGCTGCTT 


ACTGGCCCAA 


CCCCCTGGNC 


300 


CCAGGGCTTT 


CATCACCGGN NTCCACAGGC 


AGTTTTTTNT 


TCCAACTGNA 


ACGTGGACAA 


360 


GGTNCATTTG 


GAGGGACCCC CCANANGAGG 


TTTTAATCCG 


TTGATNGTTA 


TTACCGNGGT 


420 
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TTTNATTTGG GCATGGTTGN CTGGTNGNTT TGGGGAAAAA GGAACGNAAA GTTNTTTTAG 480 
GGTNCCGNTN AATTGGNTTG GGTNA 595 



<210> 78 

<211> 101 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (101) 

<223> n equals a,t,g, or c 
<400> 78 

AATTCGGCAC GAGGTGAGGG GCCCTCTGGA ATGGCATCCC ATGAGCTTGT GGCCTCTATC 60 
TGCTACCATC TGTGTTTTAT CTGAGTAAAG TTACCTTACT N 101 



<210> 79 

<211> 322 

<212> PRT 

<213> Homo sapiens 

<400> 79 

Met Gly Met Leu Ala Arg Val Ala Leu Gly Leu He He He Asp Ala 
15 10 15 

Val Leu Ala Ala Pro Thr Thr Glu Leu Phe Asn Tyr Asp Ser Glu Val 
20 25 30 

Tyr Asp Ala He Leu Glu Asp Thr Gly Thr Phe Tyr Asn Tyr Glu His 
35 40 45 

He Pro Asp Asn His Val Glu Asn Glu Lys Val Ser Glu Arg Leu Ser 
50 55 60 

Gly Asn Arg Glu Leu Leu Thr Pro Gly Pro Gin Leu Gly Asp Asn Gin 
65 70 75 80 

Asp Glu Asp Lys Asp Glu Glu Ser Thr Pro Arg Leu He Asp Gly Ser 
85 90 95 

Ser Pro Gin Glu Pro Glu Phe Pro Gly Leu Leu Gly Pro His Thr Asn 
100 105 no 

Glu Asp Phe Pro Thr Cys Leu Leu Cys Thr Cys He Ser Thr Thr Val 
115 120 125 

Tyr Cys Asp Asp His Glu Leu Asp Ala He Pro Pro Leu Pro Lys Lys 
130 135 140 

Thr Thr Tyr Phe Tyr Ser Arg Phe Asn Arg He Lys Lys He Asn Lys 



WO 00/29435 PCT/US99/25031 
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I 45 150 155 160 

Asn Asp Phe Ala Ser Leu Asn Asp Leu Lys Arg He Asp Leu Thr Ser 
165 170 175 

Asn Leu He Ser Glu He Asp Glu Asp Ala Phe Arg Lys Leu Pro His 
180 185 190 

Leu Gin Glu Leu Val Leu Arg Asp Asn Lys He Lys Gin Leu Pro Glu 
195 200 205 

Leu Pro Asn Thr Leu Thr Phe He Asp lie Ser Asn Asn Arg Leu Gly 
210 215 220 

Arg Lys Gly He Lys Gin Glu Ala Phe Lys Asp Met Tyr Asp Leu His 
225 230 235 240 

His Leu Tyr He Thr Asp Asn Ser Leu Asp His He Pro Leu Pro Leu 
245 250 255 

Pro Glu Ser Leu Arg Ala Leu His Leu Gin Asn Asn Asp He Leu Glu 
260 265 270 

Met His Glu Asp Thr Phe Cys Asn Val Lys Asn Leu Thr Tyr Val Arg 
275 280 285 

Lys Ala Leu Glu Asp He Arg Leu Asp Gly Asn Pro He Asn Leu Ser 
290 295 300 

Arg Thr Pro Gin Ala Tyr Met Cys Leu Pro Arg Leu Pro He Gly Ser 
305 310 315 320 

Phe He 



<210> 80 

<211> 435 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (85) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (248) 

<223> n equals a,t,g, or c 



<400> 80 

GGCAGAGCCA CCTTTCTGGA AGCTGCAGGG CTCTCCATCC AGGATCCAGA AGCATTGAAG 60 
GGGACCAGCC GCTGAAGGGA TTCTNAGTCC CATCTGACTC CCCATGAGGC TCCTGGCTTT 120 



CCTGAGTCTG CTGGCCTTGG TGCTGCAGGA GACAGGGACA GCTTCTCTCC CAAGGAAGGA 



180 
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GAGGAAGAGG 


AGAGAGGAGC 


AGATGCCCAG GGAAGGCGAT TCCTTTGAAG 


TTCTGCCTCT 


240 


GCGGAATNAT 


GTCCTGAACC 


CAGACAACTA TGGTGAAGTC ATTGACCTGA 


GCAACTATGA 


300 


GGAGCTCACA 


GATTATGGGG 


ACCAACTCCC CGAGGTTAAG GTGACTAGCC 


TCGCTCCTGC 


360 


AACCAGCATC 


AGTCCCGCCA 


AGAGCACTAC GGCTCCAGGG ACAACCTCGT 


CAAACCCCAC 


420 


GGATGACCCA 


GACCT 






435 



<210> 81 

<211> 449 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (57) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (98) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (391) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (433) 

<223> n equals a,t,g, or c 



<400> 81 
GACAGCCTCC 


ACCAGAGTCC 


CCACCTTTTT GGAAGCTGCA 


GGGCTCTCCA 


TCCAGGNTCC 


60 


AGAAGCATTG 


AAGGGGACCA 


GCCGCTGAAG GGATTCTNAG 


TCCCATCTGA 


CTCCCCATGA 


120 


GGCTCCTGGC 


TTTCCTGAGT 


CTGCTGGCCT TGGTGCTGCA 


GGAGACAGGG 


ACAGCTTCTT 


180 


TCCCAAGGAA 


GGAGAGGAAG AGGAGAGAGG AGCAGATGCC 


CAGGGAAGGC 


GATTCCTTTG 


240 


AAGTTCTGCC 


TCTGCGGAAT 


GATGTCCTGA ACCCAGACAA CTATGGTGAA GTCATTGACC 


300 


TGAGCAACTA 


TGAGGAGCTC 


ACAGATTATG GGGACCAACT 


CCCCGAGGTT 


AAGGTGACTA 


360 


GCCTCGCTCC 


TGCAACCAGC 


ATCAGTCCCG NCAAGAGCAC 


TACGGGCTTC 


AGGGGACAAC 


420 


CTCGTCAAAC 


CCNACGGTGA 


CCAGACCTA 






449 



<210> 82 
<211> 331 



WO 00/29435 PCT/US99/25031 

84 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (29) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (70) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (152) 

<223> n equals a,t,g, or c 
<400> 82 



CTGGAAGCTG 


CAGGGCTCTC 


CATCCAGGNT CCAGAAGCAT TGAAGGGGAC CAGCCGCTGA 


60 


AGGGATTCTN 


AGTCCCATCT 


GACTCCCCAT GAGGCTCCTG GCTTTCCTGA GTCTGCTGGC 


120 


CTTGGTGCTG 


CAGGAGACAG 


GGACAGCTTC TNTCCCAAGG AAGGAGAGGA AGAGGAGAGA 


180 


GGAGCAGATG 


CCCAGGGAAG 


GCGATTCCTT TGAAGTTCTG CCTCTGCGGA ATGATGTCCT 


240 


GAACCCAGAC 


AACTATGGTG AAGTCATTGA CCTGAGCAAC TATGAGGAGC TCACAGATTA 


300 


TGGGGACCAA 


CTCCCCGAGG 


TTAAGGTGAC T 


331 



<210> 83 

<211> 280 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (19) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (58) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (115) 

<223> n equals a,t,g, or c 
<400> 83 

CTGGCTTTCC TGAGTCTGNT GGCCTTGGTG CTGCAGGAGA CAGGGACAGC TTCTTTTNCA 



AGGAAGGAGA GGAAGAGGAG AGAGGAGCAG ATGCCCAGGG AAGGCGATTC CTTTNAAGTT 



120 



WO 00/29435 PCT/US99/25031 
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CTGCCTCTGC GGAATGATGT CCTGAACCCA GACAACTATG GTGAAGTCAT TGACCTGAGC 11 
AACTATGAGG AGCTCACAGA TTATGGGGAC CAACTCCCCG AGGTTAAGGT GACTAGCCTC 2^ 
GCTCCTGCAA CCAGCATCAG TCCCGCCAAG AGCACTACGG 21 



<210> 84 

<211> 211 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (93) 

<223> n equals a,t,g, or c 



<400> 84 
CTTGGACCAG 


CGGGCATCAC ATTCTCCAGC 


AGCCGCCATC 


TCACACGCCT 


CCCTCCTGTG 


60 


GCCGCCGGCA 


GCATGGACAA AGGTCTCCAT 


GCNGGGGGAG 


GAGGCCTGCT 


TCTTTCCCCA 


120 


CAGCTCTCAC 


GTCTCCCTTC TCCCTGCGGG 


TGACAAAGAA 


GCCCAAGGAC 


CACCTCCTTC 


180 


CTGCCTCATT 


GTAATAAAAT TCCCCACACT 


G 






211 



<210> 85 

<211> 208 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (56) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (117) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (140) 

<223> n equals a,t,g, or c 



<400> 85 

CTGGATGGCA ACCCCATCAA CCTCAGCCTC 
CTCCCCATCG GCCGCTTCAC GTAGCTCGGA 
GACCAGCGGG CATCACATTN TCCAGCAGCC 



TTCCCCAGCG CCTACTTCTG CCTGCNTCGG 60 
GCCCTTCCAC TCCTCCCAGG TCATCTNTTG 120 
GCCATCTCAC ACGCCTCCCT CCTGTGGCCG 180 



CCGGCAGCAT GGACAAAGGT CTCCATGC 



208 



WO 00/29435 



86 



PCT/US99/25031 



<210> 86 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 86 

CGACCCACGC GTCCGCCGCC TTCGGCTTCC CCTTCTGCCA AGAGCCCTGA GCCACTCACA 60 
GCACGACCAG AGA 73 



<210> 87 

<211> 305 

<212> DNA " 

<213> Homo sapiens 

<400> 87 

GTATGGAATG GGGTGGGAAC CCCTGCCTCT CACACTGGGG AGGGACCCTG GGGACAGCCT 60 

ATGGGCTGAG CAGAGAGGGC TCTCAGGGAC CCCTGCAGCA CAAGAATCTC CCACCACGGT 120 

CTCTGTCCCA GCCCTGACTC AGAAGCCTGA TGTCTACATC CCCGAGACCC TGGAGCCCGG 180 

GCAGCCGGTG ACGGTCATCT GTGTGTTTAA CTGGGCCTTT GAGGAATGTC CACCCCCTTC 240 

TTTCTCCTGG ACGGGGGCTG CCCTCTCCTC CCAAGGAACC AAACCAACGA CCTCCCACTT 300 

CTCAG 305 

<210> 88 

<211> 175 

<212> DNA 

<213> Homo sapiens 

<400> 88 

ATCCTCCAGA GAACCTGAGA GTGATGGTTT CCCAAGCAAA CAGGACAGGT AGGAAAGGGG 60 

ACAGAGGAGC CAAGGCCTCT CAGTGCCGAA TTGGGGGCCC AGGAGTCTGG AGGGTCCCCA 120 

CGCAGGAGGG TCCCTGAGCC CTGAGCTGCT CATCGATTCT GCCTCTTCCT TCCCT 175 



<210> 89 

<211> 72 

<212> DNA 

<213> Homo sapiens 

<400> 89 

GTGAGTGGGG GAAAGGGGAC ACCTGGGTCC CAGGAAGGGG ACCCTGCTGA GTCCTGTCCT 60 



CCCTCCCCTC AG 



72 



WO 00/29435 PCT/US99/25031 
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<210> 90 

<211> 421 

<212> DNA 

<213> Homo sapiens 

<400> 90 

CTGGCCCCCT GGCTCAGAAG CGGAATCAGA AAGCCACACC AAACAGTCCT CGGACCCCTC 60 
TTCCACCAGG TGCTCCCTCC CCAGAATCAA AGAAGAACCA GAAAAAGCAG TATCAGTTGC 120 
CCAGTTTCCC AGAACCCAAA TCATCCACTC AAGCCCCAGA ATCCCAGGAG AGCCAAGAGG 180 
AGCTCCATTA TGCCACGCTC AACTTCCCAG GCGTCAGACC CAGGCCTGAG GCCCGGATGC 240 
CCAAGGGCAC CCAGGCGGAT TATGCAGAAG TCAAGTTCCA ATGAGGGTCT CTTAGGCTTT 300 
AGGACTGGGA CTTCGGCTAG GGAGGAAGGT AGAGTAAGAG GTTGAAGATA ACAGAGTGCA 3 60 

AAGTTTCCTT CTCTCCCTCT CTCTCTCTCT TTCTCTCTCT CTCTCTCTTT CTCTCTCTTT 420 
T 

421 

<210> 91 

<211> 964 

<212> DNA 

<213> Homo sapiens 

<400> 91 

AAAAAAACAT CTGGCCAGGG CACAGTGGCT CACGCCTGTA ATCCCAGCAC TTTGGGAGGT 60 

TGAGGTGGGC AGATCGCCTG AGGTCGGGAG TTCGAGACCA GCCTGGCCAA CTTGGTGAAA 120 

CCCCGTCTCT ACTAAAAATA CAAAAATTAG CTGGGCATGG TGGCAGGCGC CTGTAATCCT 180 

ACTACTTGGG AAGCTGAGGC AGGAGAATCA CTTGAACCTG GGAGACGGAG GTTGCAGTGA 240 

GCCAAGATCA CACCATTGCA CGCCAGCTTG GGCAACAAAG CGAGACTCCA TCTCAAAAAA 300 

AAAATCCTCC AAATGGGTTG GGTGTCTGTA ATCCCAGCAC TTTGGGAGGC TAAGGTGGGT 360 

GGATTGCTTG AGCCCAGGAG TTCGAGACCA GCCTGGGCAA CATGGTGAAA CCCCATCTCT 420 

ACAAAAAATA CAAAACATAG CTGGGCTTGG TGGTGTGTGC CTGTAGTCCC AGCTGTCAGA 480 

CATTTAAACC AGAGCAACTC CCATCTGGAA TGGGAGCTGA ATAAAATGAG GCTGAGACCT 540 

ACTGGGCTGC CATTCTCAGA CAGTGGAGGC CATTCTAAGT CACAGGATGA GACAGGAGGT 600 

CCGTACAAGA TACAGGTCAT AAAGACTTTG CTGATAAAAC AGATTGCAGT AAAGAAGCCA 660 

ACCAAATCCC ACCAAAACCA AGTTGGCCAC GAGAGTGACC TCTGGTCGTC CTCACTGCTA 720 

CACTCCTGAC AGCACCATGA CAGTTTACAA ATGCCATGGC AACATCAGGA AGTTACCCGA 780 



WO 00/29435 PCT/US99/25031 
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TATGTCCCAA AAGGGGGAGG AATGAATAAT CCACCCCTTG TTTAGCAAAT AAGCAAGAAA 840 
TAACCATAAA AGTGGGCAAC CAGCAGCTCT AGGCGCTGCT CTTGTCTATG GAGTAGCCAT 900 
TCTTTTGTTC CTTTACTTTC TTAATAAACT TGCTTTCACC TTAAAAAAAA AAAAAAAAAA 960 
AAAA 964 



<210> 92 
<211> 364 
<212> PRT 

<213> Homo sapiens 
<400> 92 

Met Pro Leu Leu Leu Leu Leu Pro Leu Leu Trp Ala Gly Ala Leu Ala 
15 10 15 

Met Asp Pro Asn Phe Trp Leu Gin Val Gin Glu Ser Val Thr Val Gin 
20 25 30 

Glu Gly Leu Cys Val Leu Val Pro Cys Thr Phe Phe His Pro lie Pro 
35 40 45 

Tyr Tyr Asp Lys Asn Ser Pro Val His Gly Tyr Trp Phe Arg Glu Gly 
50 55 60 

Ala lie lie Ser Gly Asp Ser Pro Val Ala Thr Asn Lys Leu Asp Gin 
65 70 75 80 

Glu Val Gin Glu Glu Thr Gin Gly Arg Phe Arg Leu Leu Gly Asp Pro 
85 90 95 

Ser Arg Asn Asn Cys Ser Leu Ser He Val Asp Ala Arg Arg Arg Asp 
100 105 HO 

Asn Gly Ser Tyr Phe Phe Arg Met Glu Arg Gly Ser Thr Lys Tyr Ser 
H5 120 125 

Tyr Lys Ser Pro Gin Leu Ser Val His Val Thr Asp Leu Thr His Arg 
130 135 140 

Pro Lys He Leu He Pro Gly Thr Leu Glu Pro Gly His Ser Lys Asn 
145 150 155 160 

Leu Thr Cys Ser Val Ser Trp Ala Cys Glu Gin Gly Thr Pro Pro He 
165 170 175 

Phe Ser Trp Leu Ser Ala Ala Pro Thr Ser Leu Gly Pro Arg Thr Thr 
180 185 190 

His Ser Ser Val Leu He lie Thr Pro Arg Pro Gin Asp His Gly Thr 
195 200 205 

Asn Leu Thr Cys Gin Val Lys Phe Ala Gly Ala Gly Val Thr Thr Glu 
210 215 220 
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Arg Thr He Gin Leu Asn Val Thr Tyr Val Pro Gin Asn Pro Thr Thr 

225 230 235 240 

Gly He Phe Pro Gly Asp Gly Ser Gly Lys Gin Glu Thr Arg Ala Gly 
245 250 255 

Val Val His Gly Ala He Gly Gly Ala Gly Val Thr Ala Leu Leu Ala 
260 265 270 

Leu Cys Leu Cys Leu He Phe Phe lie Val Lys Thr His Arg Arg Lys 
275 280 285 

Ala Ala Arg Thr Ala Val Gly Arg Asn Asp Thr His Pro Thr Thr Gly 
290 295 300 

Ser Ala Ser Pro Lys His Gin Lys Lys Ser Lys Leu His Gly Pro Thr 
305 310 315 320 

Glu Thr Ser Ser Cys Ser Gly Ala Ala Pro Thr Val Glu Met Asp Glu 
325 330 335 

Glu Leu His Tyr Ala Ser Leu Asn Phe His Gly Met Asn Pro Ser Lys 
340 345 350 

Asp Thr Ser Thr Glu Tyr Ser Glu Val Arg Thr Gin 
355 360 



<210> 93 

<211> 397 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (19) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (68) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (148) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (262) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (323) 

<223> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> (361) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (386) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (388) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (390) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (392) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (394) 

<223> n equals a,t,g, or c 



<400> 93 



GGCAGAGTGG 


CCCCCTGGNT 


CAGAAGCGGA ATCAGAAAGC CACACCAAAC 


AGTCCTCGGA 


60 


CCCCTCTNCC 


ACCAGGTGCT 


CCCTCCCCAG AATCAAAGAA GAACCAGAAA 


AAGCAGTATC 


120 


AGTTGCCCAG 


TTTCCCAGAA 


CCCAAATNAT CCACTCAAGC CCCAGAATCC 


CAGGAGAGCC 


180 


AAGAGGAGCT 


CCATTATGCC 


ACGCTCAACT TCCCAGGCGT CAGACCCAGG 


CCTGAGGCCG 


240 


GTATGCCCAA 


GGGCACCCAG 


GNGGATTATG CAGAAGTCAA GTTCCAATGA 


GGGTCTCTTA 


300 


GGCTTTAGGA 


CTGGGACTTC 


GGNTAGGGAG GAAGGTAGAG TAAGAGGTTG AAGTTAACAG 


360 


NTGCAAATTT 


CCTTTTTTTC CTTTTNTNTN TNTNTTT 




397 



<210> 94 

<211> 276 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (190) 

<223> n equals a,t,g, or c 



<220> 
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<221> SITE 
<222> (241) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (262) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (264) 

<223> n equals a,t,g, or c 
<220> 

<:221> SITE 
<222> (265) 

<223> n equals a,t,g, or c 
<400> 94 

GCTCCTACCC CCGACAAGAC TGGACAGGGT CTACCCCAGC TTATGGCTAC TGGTTCAAAG 
CAGTGACTGA GACAACCAAG GGTGCTCCTG TGGCCACAAA CCACCAGAGT CGAGAGGTGG 
AAATGAGCAC CCGGGGCCGA TTCCAGCTCA CTGGGGATCC CGCCAAGGGG AACTGCTCCT 
TGGTGATCAN AAGACGCGCA GATGCAGGAT GAGTCACAGT ACTTCTTTCG GGTGGAGAGA 
NGAACTATGT GAGATATAAT GNCNNGAACG ATGGGT 



<210> 95 

<211> 469 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (348) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (375) 

<223> n equals a,t f g, or c 
<220> 

<221> SITE 
<222> (430) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (445) 

<223> n equals a,t,g, or c 



<220> 
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<221> SITE 
<222> (457) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (469) 

<223> n equals a,t,g, or c 
<400> 95 

GCAACGACCT CCCACTTCTC AGTGCTCAGC TTCACGCGCA GACCCCAGGA CCACAACACC 60 
GACCTCACCT GCCATGTGGA CTTCTCCAGA AAGGGTGTGA GCGTACAGAG GACCGTCCGA 120 
CTCCGTGTGG CCTATGCCCC CAGAGACCTT GTTATCAGCA TTTCACGTGA CAACACGCCA 180 
GATCCTCCAG AGAACCTGAG AGTGATGGTT TCCCAAGCAA ACAGGACAGT CCTGGAAAAC . 240 
CTTGGGAACG GCACGTCTCT CCCAGTACTG GAGGGCCAAA GCCTGTGCCT GGTCTGTGTC 300 
ACACACAGCA GCCCCCCAGC CAGGCTGAGC TGGACCCAGA AGGGACANGT TCTGAGCCCC 360 
TCCCAGCCCT CAGANCCCGG GGTCCTGGAG TGCCTTCGGG TTCAAGTGGA GCAACGAAAG 420 
GAGAGTTCAN CTGGCAAGGT TCGGNAACCA ATTGGGNTTC CAAGAAGGN 469 



<210> 96 

<211> 398 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (160) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (189) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (221) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (248) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (285) 

<223> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> (326) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (339) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (344) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (382) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (392) 

<223> n equals a,t,g, or c 
<400> 96 

ACGGAGCGTT TCTGGGAATC GGCATCACGG CTCTTCTTTT CCTCTGCCTG GCCCTGATCA 60 
TCATGAAGAT TCTACCGAAG AGACGGACTC AGACAGAAAC CCCGTAGGCC CAGGTTCTCC 120 
CGGCACAGCA CGATCCTGGA TTACATCAAT GTGGTCCCGN ACGGACTGGC CCCCTGGCTC 180 
AGAAGCGGNA ATCAGTAAAG CCACACCAAA CAGTCCTCGG NACCCCTCTT GCCACCAGGT 240 
GCTCCCTNCC CCAGGAATGC AAAGAAGAAC CAGAAAAAGG CAGTNATGCA GTTGCCCAGT 300 
TTGCCCAGAA CCCAAATCAT TCCATNCAAG CCCCAGAANC CCANGAGAGC CAAGAGGAGT 360 
TCCATTAAGG CCAGGTTCAA ANTTTCCCAG GNGTTCGA 398 



<210> 97 

<211> 256 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (18) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (40) 

<223> n equals a,t,g, or c 
<220> 
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<221> SITE 
<222> (52) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (68) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (73) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (85) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (91) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (126) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (127) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (162) 

<223> n equals a,t,g f or c 
<220> 

<221> SITE 
<222> (164) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (184) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (251) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (256) 

<223> n equals a,t,g, or c 
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<400> 97 


CATCC GCNGC 


CTTCAGCTTC 


CCCTTCTGCN 


AAGAGCCCTG ANCCACTCAC 


60 


AGCACGANCA 


GANAACAGGC 


CTGTNTCAAG 


NAGGCCCTGC 


GCCTCCTATG 


CGGAGATGCT 


120 


ACTGCNNCTG 


CTGCTGTCCT 


CGCTGCTGGG 


CGGGTCCCAG 


GNTNTGGATG 


GGAGATTCTG 


180 


GATNCGAGTG 


CAGGAGTCAG 


TGATGGTGCC 


GGAAGGCCTG 


TGCATCTCTG 


TGCCCTGCTC 


240 


TTTCTCCTAC 


NCCCGN 










256 



<210> 98 

<211> 299 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (239) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (264) 

<223> n equals a,t,g, or c 
<400> 98 

CCAACAGCTC CCTGAGCCTC CATGGAGGGC TCAGCTCCGG CCTCAGGCTC CGCTGTAAGG 60 
CCTGGAACGT CCACGGGGCC CAGAGTGGCT CTGTCTTCCA GCTGCTACCA GGGTGAAGAT 120 
CTGCAGGAAG GAAGCTCGCA AGAGGGCAGC AGCTGAGCAG GACGTGCCCT CCACCCTGGG 180 
ACCCATCTCC CAGGGTCACC AGCATGAATG CTCGGCAGGC AGCTCCCAAG ACCACCCGNC 240 
CCCAGGTGCA GCCACCTACA CCCNGGGGAA GGGGGAAGAG CAGGAGCTCC ACTATGCCT 299 



<210> 99 

<211> 442 

<212> PRT 

<213> Homo sapiens 

<400> 99 

Met Leu Pro Leu Leu Leu Pro Leu Leu Trp Ala Gly Ala Leu Ala Gin 
1 5 10 15 

Glu Arg Arg Phe Gin Leu Glu Gly Pro Glu Ser Leu Thr Val Gin Glu 
20 25 30 

Gly Leu Cys Val Leu Val Pro Cys Arg Leu Pro Thr Thr Leu Pro Ala 
35 40 45 



Ser Tyr Tyr Gly Tyr Gly Tyr Trp Phe Leu Glu Gly Ala Asp Val Pro 
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50 55 60 

Val Ala Thr Asn Asp Pro Asp Glu Glu Val Gin Glu Glu Thr Arg Gly 
65 70 75 80 

Arg Phe His Leu Leu Trp Asp Pro Arg Arg Lys Asn Cys Ser Leu Ser 
85 90 95 

He Arg Asp Ala Arg Arg Arg Asp Asn Ala Ala Tyr Phe Phe Arg Leu 
100 105 no 

Lys Ser Lys Trp Met Lys Tyr Gly Tyr Thr Ser Ser Lys Leu Ser Val 
115 120 125 

Arg Val Met Ala Leu Thr His Arg Pro Asn He Ser He Pro Gly Thr 
130 135 140 

Leu Glu Ser Gly His Pro Ser Asn Leu Thr Cys Ser Val Pro Trp Val 
i45 150 155 160 

Cys Glu Gin Gly Thr Pro Pro He Phe Ser Trp Met Ser Ala Ala Pro 
165 170 175 

Thr Ser Leu Gly Pro Arg Thr Thr Gin Ser Ser Val Leu Thr He Thr 
180 185 190 

Pro Arg Pro Gin Asp His Ser Thr Asn Leu Thr Cys Gin Val Thr Phe 
195 200 205 

Pro Gly Ala Gly Val Thr Met Glu Arg Thr He Gin Leu Asn Val Ser 
210 , 215 220 

Tyr Ala Pro Gin Lys Val Ala He Ser He Phe Gin Gly Asn Ser Ala 
225 230 235 240 

Ala Phe Lys He Leu Gin Asn Thr Ser Ser Leu Pro Val Leu Glu Gly 
245 250 255 

Gin Ala Leu Arg Leu Leu Cys Asp Ala Asp Gly Asn Pro Pro Ala His 
260 265 270 

Leu Ser Trp Phe Gin Gly Phe Pro Ala Leu Asn Ala Thr Pro He Ser 
275 280 285 

Asn Thr Gly Val Leu Glu Leu Pro Gin Val Gly Ser Ala Glu Glu Gly 
290 295 300 

Asp Phe Thr Cys Arg Ala Gin His Pro Leu Gly Ser Leu Gin He Ser 
305 310 315 320 

Leu Ser Leu Phe Val His Trp Lys Pro Glu Gly Arg Ala Gly Gly Val 
325 330 335 

Leu Gly Ala Val Trp Gly Ala Ser He Thr Thr Leu Val Phe Leu Cys 
340 345 3 50 

Val Cys Phe lie Phe Arg Val Lys Thr Arg Arg Lys Lys Ala Ala Gin 
355 360 365 
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Pro Val Gin Asn Thr Asp Asp Val Asn Pro Val Met Val Ser Gly Ser 
37 0 375 380 

Arg Gly His Gin His Gin Phe Gin Thr Gly He Val Ser Asp His Pro 
385 390 395 400 

Ala Glu Ala Gly Pro He Ser Glu Asp Glu Gin Glu Leu His Tyr Ala 
405 410 415 

Val Leu His Phe His Lys Val Gin Pro Gin Glu Pro Lys Val Thr Asp 
420 425 430 

Thr Glu Tyr Ser Glu He Lys He His Lys 
435 440 



<210> 100 

<211> 258 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (111) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (193) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (196) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (230) 

<223> n equals a,t,g, or c 



<400> 100 

AAACTCGGGA CCGATTCCAC CTCCTTGGGG ACCCACAGAC CAAAAATTGC ACCCTGAGCA 
TCAGAGATCC AGAATGAGTG ATGCGGGGAG ATACTTCTTT CGTATGGAGA NAGGAAATAT 
AAAATGGAAT TATAAATATG ACCAGCTCTC TGTGAACGTG ACAGACCCTC CTCAGAACTT 
GACTGTGACT GTNTTNCAAG GAGAAGGCAC AGCATCCACA GCTCTGGGGN ACAGCTCATC 
TCTTTCAGTC CTAGAGGG 



<210> 101 
<211> 139 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> SITE 
<222> (21) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (26) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (34) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (37) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (92) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (110) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (123) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (131) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (133) 

<223> n equals a,t,g, or c 
<400> 101 

CTCGCTGACG ATGCAGAGTT NCGTGNCCGT GCANGANGGC ATGTGTGTCC ATGTGCGCTG 60 
CTCCTTCTCC TACCCAGTGG ACGCCAGACT GNATCTGCAC ACGTTCTGGN TATGGTTCGG 120 
GCNGGAATGA NTNAGCTGG nq 



<210> 102 
<211> 33 
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<212> PRT 

<213> Homo sapiens 
<400> 102 

Thr Asn Gly Tyr Gin Lys Thr Gly Asp Val Tyr Lys Cys Pro Val lie 
1 5 io 15 

His Gly Asn Cys Thr Lys Leu Asn Leu Gly Arg Val Thr Leu Ser Asn 
20 25 30 

Val 



<210> 103 

<211> 1151 

<212> PRT 

<213> Homo sapiens 

<400> 103 

Phe Asn Val Asp Val Lys Asn Ser Met Thr Phe Ser Gly Pro Val Glu 
15 10 15 

Asp Met Phe Gly Tyr Thr Val Gin Gin Tyr Glu Asn Glu Glu Gly Lys 
20 25 30 

Trp Val Leu He Gly Ser Pro Leu Val Gly Gin Pro Lys Asn Arg Thr 
35 40 45 

Gly Asp Val Tyr Lys Cys Pro Val Gly Arg Gly Glu Ser Leu Pro Cys 
50 55 60 

Val Lys Leu Asp Leu Pro Val Asn Thr Ser He Pro Asn Val Thr Glu 
65 7 0 75 80 

Val Lys Glu Asn Met Thr Phe Gly Ser Thr Leu Val Thr Asn Pro Asn 
85 90 95 

Gly Gly Phe Leu Ala Cys Gly Pro Leu Tyr Ala Tyr Arg Cys Gly His 
100 105 HO 

Leu His Tyr Thr Thr Gly He Cys Ser Asp Val Ser Pro Thr Phe Gin 
115 120 125 

Val Val Asn Ser He Ala Pro Val Gin Glu Cys Ser Thr Gin Leu Asp 
130 135 140 

He Val He Val Leu Asp Gly Ser Asn Ser He Tyr Pro Trp Asp Ser 
145 150 155 160 

Val Thr Ala Phe Leu Asn Asp Leu Leu Lys Arg Met Asp He Gly Pro 
165 170 175 

Lys Gin Thr Gin Val Gly He Val Gin Tyr Gly Glu Asn Val Thr His 
180 185 190 

Glu Phe Asn Leu Asn Lys Tyr Ser Ser Thr Glu Glu Val Leu Val Ala 
i95 200 205 
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Ala Lys Lys He Val Gin Arg Gly Gly Arg Gin Thr Met Thr Ala Leu 
210 215 

Gly Thr Asp Thr Ala Arg Lys Glu Ala Phe Thr Glu Ala Arg Gly Ala 
225 230 235 240 

Arg Arg Gly Val Lys Lys Val Met Val He Val Thr Asp Gly Glu Ser 
245 250 255 

His Asp Asn His Arg Leu Lys Lys Val He Gin Asp Cys Glu Asp Glu 
260 265 270 

Asn He Gin Arg Phe Ser He Ala He Leu Gly Ser Tyr Asn Arg Gly 
275 280 285 

Asn Leu Ser Thr Glu Lys Phe Val Glu Glu He Lys Ser He Ala Ser 
290 295 300 

Glu Pro Thr Glu Lys His Phe Phe Asn Val Ser Asp Glu Leu Ala Leu 
305 31 ° 315 320 

Val Thr lie Val Lys Thr Leu Gly Glu Arg He Phe Ala Leu Glu Ala 
325 330 335 

Thr Ala Asp Gin Ser Ala Ala Ser Phe Glu Met Glu Met Ser Gin Thr 
340 345 350 

Gly Phe Ser Ala His Tyr Ser Gin Asp Trp Val Met Leu Gly Ala Val 
355 360 365 

Gly Ala Tyr Asp Trp Asn Gly Thr Val Val Met Gin Lys Ala Ser Gin 
370 375 380 

He He He Pro Arg Asn Thr Thr Phe Asn Val Glu Ser Thr Lys Lys 
385 3 50 395 400 

Asn Glu Pro Leu Ala Ser Tyr Leu Gly Tyr Thr Val Asn Ser Ala Thr 
405 410 415 

Ala Ser Ser Gly Asp Val Leu Tyr He Ala Gly Gin Pro Arg Tyr Asn 
420 425 430 

His Thr Gly Gin Val He He Tyr Arg Met Glu Asp Gly Asn He Lys 
435 440 445 

He Leu Gin Thr Leu Ser Gly Glu Gin He Gly Ser Tyr Phe Gly Ser 
450 455 460 

He Leu Thr Thr Thr Asp He Asp Lys Asp Ser Asn Thr Asp He Leu 
465 47 ° 475 480 

Leu Val Gly Ala Pro Met Tyr Met Gly Thr Glu Lys Glu Glu Gin Gly 
485 4 90 495 

Lys Val Tyr Val Tyr Ala Leu Asn Gin Thr Arg Phe Glu Tyr Gin Met 
500 505 5io 
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Ser Leu Glu Pro He Lys Gin Thr Cys Cys Ser Ser Arg Gin His Asn 
515 520 525 

Ser Cys Thr Thr Glu Asn Lys Asn Glu Pro Cys Gly Ala Arg Phe Gly 
530 535 540 

Thr Ala He Ala Ala Val Lys Asp Leu Asn Leu Asp Gly Phe Asn Asp 
545 550 555 560 

He Val He Gly Ala Pro Leu Glu Asp Asp His Gly Gly Ala Val Tyr 
565 570 575 

He Tyr His Gly Ser Gly Lys Thr He Arg Lys Glu Tyr Ala Gin Arg 
580 585 590 

lie Pro Ser Gly Gly Asp Gly Lys Thr Leu Lys Phe Phe Gly Gin Ser 
595 600 605 

He His Gly Glu Met Asp Leu Asn Gly Asp Gly Leu Thr Asp Val Thr 
610 615 620 

He Gly Gly Leu Gly Gly Ala Ala Leu Phe Trp Ser Arg Asp Val Ala 
625 630 635 640 

Val Val Lys Val Thr Met Asn Phe Glu Pro Asn Lys Val Asn He Gin 
645 650 655 

Lys Lys Asn Cys His Met Glu Gly Lys Glu Thr Val Cys He Asn Ala 
660 665 670 

Thr Val Cys Phe Glu Val Lys Leu Lys Ser Lys Glu Asp Thr He Tyr 
675 680 685 

Glu Ala Asp Leu Gin Tyr Arg Val Thr Leu Asp Ser Leu Arg Gin He 
690 695 700 

Ser Arg Ser Phe Phe Ser Gly Thr Gin Glu Arg Lys Val Gin Arg Asn 
7 °5 710 715 720 

He Thr Val Arg Lys Ser Glu Cys Thr Lys His Ser Phe Tyr Met Leu 
725 730 735 

Asp Lys His Asp Phe Gin Asp Ser Val Arg He Thr Leu Asp Phe Asn 
740 745 750 

Leu Thr Asp Pro Glu Asn Gly Pro Val Leu Asp Asp Ser Leu Pro Asn 
755 760 765 

Ser Val His Glu Tyr He Pro Phe Ala Lys Asp Cys Gly Asn Lys Glu 
770 775 780 

Lys Cys He Ser Asp Leu Ser Leu His Val Ala Thr Thr Glu Lys Asp 
785 790 795 800 

Leu Leu He Val Arg Ser Gin Asn Asp Lys Phe Asn Val Ser Leu Thr 
805 810 815 

Val Lys Asn Thr Lys Asp Ser Ala Tyr Asn Thr Arg Thr He Val His 
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820 825 830 

Tyr Ser Pro Asn Leu Val Phe Ser Gly lie Glu Ala lie Gin Lys Asp 
835 840 845 

Ser Cys Glu Ser Asn His Asn lie Thr Cys Lys Val Gly Tyr Pro Phe 
850 855 860 

Leu Arg Arg Gly Glu Met Val Thr Phe Lys lie Leu Phe Gin Phe Asn 
865 870 875 880 

Thr Ser Tyr Leu Met Glu Asn Val Thr lie Tyr Leu Ser Ala Thr Ser 
885 890 895 

Asp Ser Glu Glu Pro Pro Glu Thr Leu Ser Asp Asn Val Val Asn He 
900 905 910 

Ser He Pro Val Lys Tyr Glu Val Gly Leu Gin Phe Tyr Ser Ser Ala 
915 920 925 

Ser Glu Tyr His He Ser He Ala Ala Asn Glu Thr Val Pro Glu Val 
930 935 940 

He Asn Ser Thr Glu Asp He Gly Asn Glu He Asn He Phe Tyr Leu 
945 950 955 960 

He Arg Lys Ser Gly Ser Phe Pro Met Pro Glu Leu Lys Leu Ser He 
965 970 975 

Ser Phe Pro Asn Met Thr Ser Asn Gly Tyr Pro Val Leu Tyr Pro Thr 
980 985 990 

Gly Leu Ser Ser Ser Glu Asn Ala Asn Cys Arg Pro His He Phe Glu 
995 1000 1005 

Asp Pro Phe Ser He Asn Ser Gly Lys Lys Met Thr Thr Ser Thr Asp 
1010 1015 1020 

His Leu Lys Arg Gly Thr He Leu Asp Cys Asn Thr Cys Lys Phe, Ala 
1030 1035 1040 

Thr He Thr Cys Asn Leu Thr Ser Ser Asp He Ser Gin Val Asn Val 
1045 1050 1055 

Ser Leu He Leu Trp Lys Pro Thr Phe He Lys Ser Tyr Phe Ser Ser 
1060 1065 1070 

Leu Asn Leu Thr He Arg Gly Glu Leu Arg Ser Glu Asn Ala Ser Leu 
1075 1080 1085 

Val Leu Ser Ser Ser Asn Gin Lys Arg Glu Leu Ala He Gin He Ser 
1090 1095 1100 

Lys Asp Gly Leu Pro Gly Arg Val Pro Leu Trp Val He Leu Leu Ser 
1110 1115 1120 



Ala Phe Ala Gly Leu Leu Leu Leu Met Leu Leu He Leu Ala Leu Trp 
1125 1130 1135 
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Lys He Gly Phe Phe Lys Arg Pro Leu Lys Lys Lys Met Glu Lys 
1140 H45 H50 

<210> 104 
<211> 389 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (256) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (279) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (302) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (310) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (347) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (349) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (382) 

<223> n equals a,t,g, or c 
<400> 104 

GCCTCCAGTA TTTTGGCTGC AGCATCCACG GGCAATTGGT ACCTCAATGA GGGATGGGCT 60 
CATCGACCTG GCAGTGGGAG CCCTTGGCAA CGCTGTGATT CTGTGGTCCC GCCCAGTGGT 120 
TCAGATCAAT GCCAGCCTCC ACTTTGAGCC ATCCAAGATC AACATCTTCC ACAGAGACTG 180 
CAAGCGCAGT GGCAGGGATG CCACCTGCCT GGCCGCCTTC CTCTGCTTCA CGCCCATCTT 240 
CCTGGCACCC CATTTNCAAA CAACAACTGT TGGCATCANA TACAACGCCA CCATGGGATG 300 
ANAGGCGGTN TACACCGAGG GCCCACCTGG ACAAGGCGGG GACCGANTNA CAACAGAACC 360 
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GTACTGGTTT TCTTCGGCCA GNAACTTGT 389 



<210> 105 
<211> 491 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (11) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (212) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (268) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (274) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (290) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (297) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (317) 

<223> n equals a,t,g, or c 
<220> 

<221>- SITE 
<222> (355) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (381) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (384) 

<223> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> (398) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (400) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (404) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (405) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (407) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (413) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (414) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (415) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (434) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (438) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (443) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
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<222> (462) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (465) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (466) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (467) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (482) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (483) 

<223> n equals a,t,g, or c 
<400> 105 



GAATTCGGCA NAGGCAGACA CTGCTCCTTG GGTAGCCATA ACTGTGTCCC CACTTTCCAG 


60 


GGTGCCAGAC CTACATGGAC 


ATCGTCATTG TCCTGGATGG GTCCAACAGC ATCTACCCCT 


120 


GGGTGGGGGT TCAGCACTTC 


CTCATCAACA TCCTGAAAAA GTTTTACATT GGCCCAGGGC 


180 


AGATCCAGGT TGGAGTTGTG 


CAGTATGGCG ANGATGTGGT GCATGAGTTT CACCTCAATG 


240 


ACTACAGGTC TGTAAAAGAT 


GTGGTGGNAG CTGNCAGCCA CATTGAGCCN GAGAGGNGGG 


300 


ACAGAGACCC GGACGGNATT 


TGGCATTGGA ATTTGGCACG GTTAAAAAAA AAGTNGGGCC 


360 


AAAAAAATTT TTTTTGGGTC 


NCANATGCTT TTGTAGTNAN TCCNNTNGGC TTNNNAACAA 


420 


ATTTCCCCAA ATTNGGTNCC 


CTNGGGCCAT ATTTTTTTAT TNCCNNNAAT TTTGGGGTTT 


480 


GNNTGGGTCA A 




491 



<210> 106 
<211> 340 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (5) 

<223> n equals a,t,g, or c 
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PCT/US99/25031 
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<220> 

<221> SITE 
<222> (188) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (251) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (292) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (308) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (312) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (327) 

<223> n equals a,t,g, or c 
<400> 106 



GGCANAGGAG 


TAAATGAGCG 


GGACAGCACC 


AAGGAAGACA ACGTGGCCCC 


CTTACGCTTC 


60 


CACCTCAAAT 


ACGAGGCTGA 


CGTCCTCTTC 


ACCAGGAGCA 


GCAGCCTGAG 


CCACTACGAA 


120 


GGTCAAGCTC 


AACAGCTCGC 


TGGAGAGATA 


CGATGGTATC 


GGGCCTCCCT 


TCAGCTGCAT 


180 


CTTCAGGNTC 


CAGAACTTGG 


GCTTGTTCCC 


CATCCACGGG 


ATGATGGATG 


GAAGATCACC 


240 


ATTCCCATCG 


NCACCAGGAG 


CGGCAACCGC 


CTTACTGAAG 


TTGAGGGACT 


TNCTCAAGGA 


300 


CGAGGCGNAC 


ANGTCCTGTA AACATTNGGG GCAATAGCAT 






340 



<210> 107 
<211> 356 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (5) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
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108 

<222> (9) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (84) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (298) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (311) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (312) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (336) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (339) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (342) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (343) 

<223> n equals a,t,g, or c 



<400> 107 

GGCANAGGNG AAATCAATTT CCATCTACTG GGGAACCTGT GGTTGAGGTC CCTAAAAGCA 60 

CTCAAGTACA GATCCATGAA AATNATGGTC AACGCAGCCT TGCAGAGGCA GGTTGCACAG 120 

CCCCTTCATC TTCCGTGAAG GAGGTATCCC AGCCGCCAGA TCGTGTTTGA GAATCTCCAA 180 

GCAAGAGGAC TGGCAGGTCC CCATCTGGAT CATTGTAGGC AGCACCCTGG GGGGCCTCCT 240 

ACTGCTGGCC CTGCTGGTTC CTGGGCACTG TGGGAAGCTT CGGTTTCTTT AAGAAGTNTC 300 



CAGGTCGCAG NNGGGGTAGC CTGGTTCTGG GACCCNAANC CNNAAAATTG CTGGGT 



356 
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<210> 108 
<211> 262 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (76) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (216) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (234) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (248) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (257) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (259) 

<223> n equals a,t,g, or c 



<400> 108 
CTGCACACCG 


GACCCAGCCG 


CCGTGCCGCG 


GGCCATGGTA 


CCTGCCCAGG 


GGCCTGGTGG 


60 


TGGCCTGGGC 


GCTCANCCTG 


TGGCCAGGGT 


TCACGGACAC 


CTTCAACATG 


GTACACCAGG 


120 


AAGCCCCGGG 


TCATCCCTGG 


CTCCAGGACC 


GCCTTCTTTG 


GCTACACAGT 


GCAGCAGCAC 


180 


GACATCAGTG 


GCAATAAGTG 


GCTGGTCCTT 


GGGGCNCCCC 


ACTGGAAACC 


AATNGCTACA 


240 


AAAAAAANGG 


AAAACTNTNC 


AA 








262 



<210> 109 
<211> 327 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (7) 

<223> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> (33) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (47) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (55) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (67) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (98) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (153) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (186) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (259) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (283) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (293) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (316) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (326) 
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<223> n equals a,t,g, or c 



<400> 109 








GGCAGANCTA 


CAGCACGGTC CTAAATATCT CGNAGTCAGC AAACCTNCAG 


TTTCNAGCTT 


60 


AATCCANAAG 


GAGGACTCAA ACGGTAGCAT TGAGTGTNTG AACGAGGAGA 


GGAGGCTCCA 


120 


GAAGCAAGTT 


TGCAACGTCA GCTATCCCTT CTNCCGGGCC AAGGCCAAGG 


TGGCTTTCCG 


180 


TCTTGNATTT 


TAAGTTCAGC AAATCCATCT TCCTACACCA CCTGGAGATC GAGCCTCGCT 


240 


GCAGCAGTGA 


CAGTAATGNG ATGAGGGGTT AGCCGGGGTT TGNACCCAAG 


TGNATTTCAC 


300 


ATTGGCCGGA 


AATTTNAAAA TTTCCNC 




327 



<210> 110 

<211> 301 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (71) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (72) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (90) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (194) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (212) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (241) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (245) 

<223> n equals a,t,g, or c 



<220> 
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<221> SITE 
<222> (268) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (275) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (290) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (296) 

<223> n equals a,t,g, or c 



<400> 110 

CCCATAGATA GGCCCCTGGG GCTCCTGAAA GAATGAACCC AAGAGCAAGG GCTTAATGGT 60 

AACAGCTGCA NNAAGGGAAT GAAGAAAGAN TCTAAGAATG TGGAGACTGA TGGCCAGGCA 120 

AGTGGGACCA GGATACTGAA CGCTGTCCTG AAGAATGAGA AGGTAGCCGG GCTCTGCACC 180 

CACGTGCATT GCANATTGAA CCGCAACTGA ANACATTCCC CCACCAGCTG CAGCCCCTTG 240 

NTCTNCAGTT GCCAACCCTC CCGGGTGNAA TTTTNTTCCC AGGTACCTTN ATGGGNAAGC 300 

301 



<210> 111 
<211> 478 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (96) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (285) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (359) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (387) 

<223> n equals a,t,g, or c 



WO 00/29435 



113 



PCI7US99/25031 



<220> 

<221> SITE 
<222> (407) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (433) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (452) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (469) 

<223> n equals a,t,g, or c 
<400> 111 

GGCACAGTGG CCGTGGGCAG GTCACTCCGG GCATCCAACA CAAGGTCAGG GACACAGTGC 60 
TCATCCTCAT TGCAGCCGTT CCAGAAGGGC ACCTGNGCCA GGGAGAGCCA GGTGTGGGCA 120 
GCTGGGTAGG GACCCGCAGC CCCTCGCCCT CAATGTACAC CAGCTCTGTC TCCACCACAC 180 
TAGACATGGG CTGGCTTTCC TGCACTGTCC CCAGACACCA CACTGCTCTG TCTTGTGCTT 240 
TTCCATAGAT GCTTCCCTCT TTAAAACGAT GCTCAAAGCT TCAGNTCCTC CTGGCTCCCC 300 
TCCAGTTTCA TGAATGGAGC TGATGGCACA GAACCCCCAA CCCCATTCAA CCAGCAGANG 360 
GTTCTGGTTC AACATTTATT GATCAANAAT GTGTGTGGGG CAAGGGNTTG GTAATGGGGG 420 
TTACAAAACA AGNAAGTTGG CAGTTTTGGC CNTCAGTTCA AGGGGAAANT GGTGTTTG 478 



<210> 112 
<211> 224 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (12) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (21) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (49) 
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<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (66) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (176) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (184) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (197) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (212) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (221) 

<223> n equals a,t,g, or c 
<400> 112 

CTATTGCCCC ANATGTTACA NGACGTGTTC GCCTCGTCCG TGTAGGAANT CCCTCAGCTT 60 
CAGTANGCGG TTGCCGCTCC TGGTGGCGAG GGAATGGTGA ATCTTCATCA TCATCCCGTG ' 120 
GTATGGGGAA CAAACCCAAG TTCTGGATCC TGAAAAATCA ACTGAAAGGA AGGCCNATAC 180 
ATCNTATCTC TCCAACNAAC TGTTGGCTTG ANCTCCTAAT NGTT 224 



<210> 113 
<211> 126 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (72) 

<223> n equals a,t,g, or c 
<400> 113 

GGCACGAGCT GACATCTCTC TGTCACTCTG TTGCAGGCAC CAACAAGAAC GAGACCTCCT 60 
TTGGGCTGTG GNAGTCACAG ACGGGCTTTT CCTCGCACGT GGTGGAGGTA TGTGGTGGAG 120 



WO 00/29435 PCT/US99/25031 

115 

GTACGG 126 



<210> 114 
<211> 150 
<212> PRT 
<213> Homo sapiens 



<400> 114 

Met Leu Phe Ala Cys Ala Leu Leu Ala Leu Leu Gly Leu Ala Thr Ser 
15 10 15 

Cys Ser Phe lie Val Pro Arg Ser Glu Trp Arg Ala Leu Pro Ser Glu 
20 25 30 

Cys Ser Ser Arg Leu Gly His Pro Val Arg Tyr Val Val He Ser His 
35 40 45 

Thr Ala Gly Ser Phe Cys Asn Ser Pro Asp Ser Cys Glu Gin Gin Ala 
50 55 60 

Arg Asn Val Gin His Tyr His Lys Asn Glu Leu Gly Trp Cys Asp Val 
65 70 75 80 

Ala Tyr Asn Phe Leu He Gly Glu Asp Gly His Val Tyr Glu Gly Arg 
85 90 95 

Gly Trp Asn He Lys Gly Asp His Thr Gly Pro He Trp Asn Pro Met 
100 105 HO 

Ser He Gly He Thr Phe Met Gly Asn Phe Met Asp Arg Val Arg Lys 
115 120 125 

Ala Gly Pro Pro Cys Cys Pro Lys Ser Ser Gly lie Trp Gly Val Ser 
130 135 140 



Gly Leu Pro Glu He Gin 
145 150 



<210> 115 
<211> 285 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (18) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (263) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 



WO 00/29435 



PCT/US99/25031 



116 

<222> (270) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (273) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (276) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (283) 

<223> n equals a,t,g, or c 
<400> 115 

ACCCCAGGCC ATCCGGGNAG CCCAGGGTCT ACTGGCCTGC GGTGTGGCTC AGGGAGCCTG * 60 

AGGTCCAACT ATGTGCTCAA AGGACACCGG GATGTGCAGC GTACACTCTC TCCAGGCAAC 120 
CAGCTCTACC ACCTCATCCA GAATTGGCCA CACTACCGCT CCCCCTGAGG CCCTGCTGAT 180 
CCGCACCCCA TTCCTCCCCT CCCATGGCCA AAAACCCCAC TGTCTCCTTC TCCAATAAAG 240 
ATGTAGCTCA AAAAAAAAAA AANAAAAAAN CCNGGNGGGG GGNCC 285 



<210> 116 

<211> 457 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (5) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (9) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (27) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (57) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
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<222> (70) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (75) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (80) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (110) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (173) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (176) 

<223> n equals a,t,g ( or c 
<220> 

<221> SITE 
<222> (212) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (237) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (300) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (310) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (318) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (320) 

<223> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> (326) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (337) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (348) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (396) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (399) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (436) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (439) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (444) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (451) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (453) 

<223> n equals a,t,g, or c 
<400> 116 

GGCANAGCNG GACCCTGCCG CCCTGCNACT ATGTCCCGCC GCTCTATGCT GCTTGCNTGG 60 
GCTCTCCCCN GCCTNCTTCN ACTCGAAGCG GCTCAGGAGA CAGAAGACCN GGCCTGCTGC 120 
AGCCCCATAG TGCCCCGGAA ACAGTGGAAG GCCCTGGTAT CAAAGTGCGC CCNGCNCTGA 180 



AGACCTGCCC TTACGCTATG TGGTGGTATC GNACACGGCG GGCAGCAGCT GCAACANCCC 



240 



WO 00/29435 PCT/US99/25031 
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CGATTTGTTG CCAGCAGCAA GCCCGGAATG TGCAGCACTA CCCACATGAA GACACTGGGN 300 
TGGTGCGAAN GTGGGGTNAN AAATTNCTGA TTTGGGNGAA AGAAGGGNTT CTTATAAGAG 360 
GGGGCCTTTG GTTTGGGAAT TTAAAGGGTT GCCCCNTTNA GGTGAATTTT TGGGAAACCC 420 
TTGTTCCATT GGGATNAANT TTCNTGGGCA NTNACAT 457 



<210> 117 
<211> 1833 
<212> DNA 

<213> Homo sapiens 
* <400> 117 

^ACATCCATGG CTCTAATGCT CAGTTTGGTT CTGAGTCTCC TCAAGCTGGG ATCAGGGCAG 60 

TGGCAGGTGT TTGGGCCAGA CAAGCCTGTC CAGGCCTTGG TGGGGGAGGA CGCAGCATTC 120 

TCCTGTTTCC TGTCTCCTAA GACCAATGCA GAGGCCATGG AAGTGCGGTT CTTCAGGGGC 180 

CAGTTCTCTA GCGTGGTCCA CCTCTACAGG GACGGGAAGG ACCAGCCATT TATGCAGATG 240 

CCACAGTATC AAGGCAGGAC AAAACTGGTG AAGGATTCTA TTGCGGAGGG GCGCATCTCT 300 

CTGAGGCTGG AAAACATTAC TGTGTTGGAT GCTGGCCTCT ATGGGTGCAG GATTAGTTCC 360 

CAGTCTTACT ACCAGAAGGC CATCTGGGAG CTACAGGTGT CAGCACTGGG CTCAGTTCCT 420 

CTCATTTCCA TCACGGGATA TGTTGATAGA GACATCCAGC TACTCTGTCA GTCCTCGGGC 480 

TGGTTCCCCC GGCCCACAGC GAAGTGGAAA GGTCCACAAG GACAGGATTT GTCCACAGAC 540 

TCCAGGACAA ACAGAGACAT GCATGGCCTG TTTGATGTGG AGATCTCTCT GACCGTCCAA 600 

^GAGAACGCCG GGAGCATATC CTGTTCCATG CGGCATGCTC ATCTGAGCCG AGAGGTGGAA 660 

.TCCAGGGTAC AGATAGGAGA TACCTTTTTC GAGCCTATAT CGTGGCACCT GGCTACCAAA 720 

GTACTGGGAA TACTCTGCTG TGGCCTATTT TTTGGCATTG TTGGACTGAA GATTTTCTTC 780 

TCCAAATTCC AGTGGAAAAT CCAGGCGGAA CTGGACTGGA GAAGAAAGCA CGGACAGGCA 840 

GAATTGAGAG ACGCCCGGAA ACACGCAGTG GAGGTGACTC TGGATCCAGA GACGGCTCAC 900 

CCGAAGCTCT GCGTTTCTGA TCTGAAAACT GTAACCCATA GAAAAGCTCC CCAGGAGGTG 960 

CCTCACTCTG AGAAGAGATT TACAAGGAAG AGTGTGGTGG CTTCTCAGAG TTTCCAAGCA 1020 

GGGAAACATT ACTGGGAGGT GGACGGAGGA CACAATAAAA GGTGGCGCGT GGGAGTGTGC 1080 

CGGGATGATG TGGACAGGAG GAAGGAGTAC GTGACTTTGT CTCCCGATCA TGGGTACTGG 1140 

GTCCTCAGAC TGAATGGAGA ACATTTGTAT TTCACATTAA ATCCCCGTTT TATCAGCGTC 1200 

TTCCCCAGGA CCCCACCTAC AAAAATAGGG GTCTTCCTGG ACTATGAGTG TGGGACCATC 1260 



WO 00/29435 PCTAJS99/25031 

120 

TCCTTCTTCA ACATAAATGA CCAGTCCCTT ATTTATACCC TGACATGTCG GTTTGAAGGC 1320 

TTATTGAGGC CCTACATTGA GTATCCGTCC TATAATGAGC AAAATGGAAC TCCCAGAGAC 1380 

AAGCAACAGT GAGTCCTCCT CACAGGCAAC CACGCCCTTC CTCCCCAGGG GTGAAATGTA 1440 

GGATGAATCA CATCCCACAT TCTTCTTTAG GGATATTAAG GTCTCTCTCC CAGATCCAAA 1500 

GTCCCGCAGC AGCCGGCCAA GGTGGCTTCC AGATGAAGGG GGACTGGCCT GTCCACATGG 1560 

GAGTCAGGTG TCATGGCTGC CCTGAGCTGG GAGGGAAGAA GGCTGACATT ACATTTAGTT 1620 

TGCTCTCACT CCATCTGGCT AAGTGATCTT GAAATACCAC CTCTCAGGTG AAGAACCGTC 1680 

AGGAATTCCC ATCTCACAGG CTGTGGTGTA GATTAAGTAG ACAAGGAATG TGAATAATGC 1740 

TTAGATCTTA TTGATGACAG AGTGTATCCT AATGGTTTGT TCATTATATT ACACTTTCAG " 1800 

TAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 1833 



<210> 118 

<211> 1386 

<212> DNA 

<213> Homo sapiens 

<400> 118 

ATGGCTCTAA TGCTCAGTTT GGTTCTGAGT CTCCTCAAGC TGGGATCAGG GCAGTGGCAG 60 

GTGTTTGGGC CAGACAAGCC TGTCCAGGCC TTGGTGGGGG AGGACGCAGC ATTCTCCTGT 120 

TTCCTGTCTC CTAAGACCAA TGCAGAGGCC ATGGAAGTGC GGTTCTTCAG GGGCCAGTTC 180 

TCTAGCGTGG TCCACCTCTA CAGGGACGGG AAGGACCAGC CATTTATGCA GATGCCACAG 240 

TATCAAGGCA GGACAAAACT GGTGAAGGAT TCTATTGCGG AGGGGCGCAT CTCTCTGAGG 300 

CTGGAAAACA TTACTGTGTT GGATGCTGGC CTCTATGGGT GCAGGATTAG TTCCCAGTCT 360 

TACTACCAGA AGGCCATCTG GGAGCTACAG GTGTCAGCAC TGGGCTCAGT TCCTCTCATT 420 

TCCATCACGG GATATGTTGA TAGAGACATC CAGCTACTCT GTCAGTCCTC GGGCTGGTTC 480 

CCCCGGCCCA CAGCGAAGTG GAAAGGTCCA CAAGGACAGG ATTTGTCCAC AGACTCCAGG 540 

ACAAACAGAG ACATGCATGG CCTGTTTGAT GTGGAGATCT CTCTGACCGT CCAAGAGAAC 600 

GCCGGGAGCA TATCCTGTTC CATGCGGCAT GCTCATCTGA GCCGAGAGGT GGAATCCAGG 660 

GTACAGATAG GAGATACCTT TTTCGAGCCT ATATCGTGGC ACCTGGCTAC CAAAGTACTG 720 

GGAATACTCT GCTGTGGCCT ATTTTTTGGC ATTGTTGGAC TGAAGATTTT CTTCTCCAAA 780 

TTCCAGTGGA AAATCCAGGC GGAACTGGAC TGGAGAAGAA AGCACGGACA GGCAGAATTG 840 

AGAGACGCCC GGAAACACGC AGTGGAGGTG ACTCTGGATC CAGAGACGGC TCACCCGAAG 900 
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CTCTGCGTTT CTGATCTGAA AACTGTAACC CATAGAAAAG CTCCCCAGGA GGTGCCTCAC 960 

TCTGAGAAGA GATTTACAAG GAAGAGTGTG GTGGCTTCTC AGAGTTTCCA AGCAGGGAAA 1020 

CATTACTGGG AGGTGGACGG AGGACACAAT AAAAGGTGGC GCGTGGGAGT GTGCCGGGAT 1080 

GATGTGGACA GGAGGAAGGA GTACGTGACT TTGTCTCCCG ATCATGGGTA CTGGGTCCTC 1140 

AGACTGAATG GAGAACATTT GTATTTCACA TTAAATCCCC GTTTTATCAG CGTCTTCCCC 1200 

AGGACCCCAC CTACAAAAAT AGGGGTCTTC CTGGACTATG AGTGTGGGAC CATCTCCTTC 1260 

TTCAACATAA ATGACCAGTC CCTTATTTAT ACCCTGACAT GTCGGTTTGA AGGCTTATTG 1320 

AGGCCCTACA TTGAGTATCC GTCCTATAAT GAGCAAAATG GAACTCCCAG AGACAAGCAA 1380 

CAGTGA 1386 



<210> 119 
<211> 461 
<212> PRT 
<213> Homo sapiens 

<400> 119 

Met Ala Leu Met Leu Ser Leu Val Leu Ser Leu Leu Lys Leu Gly Ser 
15 10 15 

Gly Gin Trp Gin Val Phe Gly Pro Asp Lys Pro Val Gin Ala Leu Val 
20 25 30 

Gly Glu Asp Ala Ala Phe Ser Cys. Phe Leu Ser Pro Lys Thr Asn Ala 
35 40 45 

Glu Ala Met Glu Val Arg Phe Phe Arg Gly Gin Phe Ser Ser Val Val 
50 55 60 

His Leu Tyr Arg Asp Gly Lys Asp Gin Pro Phe Met Gin Met Pro Gin 
65 70 75 80 

Tyr Gin Gly Arg Thr Lys Leu Val Lys Asp Ser lie Ala Glu Gly Arg 
85 90 95 

lie Ser Leu Arg Leu Glu Asn lie Thr Val Leu Asp Ala Gly Leu Tyr 
100 105 110 

Gly Cys Arg lie Ser Ser Gin Ser Tyr Tyr Gin Lys Ala He Trp Glu 
115 120 125 

Leu Gin Val Ser Ala Leu Gly Ser Val Pro Leu He Ser He Thr Gly 
130 135 140 

Tyr Val Asp Arg Asp He Gin Leu Leu Cys Gin Ser Ser Gly Trp Phe 
145 150 155 160 

Pro Arg Pro Thr Ala Lys Trp Lys Gly Pro Gin Gly Gin Asp Leu Ser 
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165 170 175 

Thr Asp Ser Arg Thr Asn Arg Asp Met His Gly Leu Phe Asp Val Glu 
180 185 190 

lie Ser Leu Thr Val Gin Glu Asn Ala Gly Ser He Ser Cys Ser Met 
195 200 205 

Arg His Ala His Leu Ser Arg Glu Val Glu Ser Arg Val Gin He Gly 
210 215 220 

Asp Thr Phe Phe Glu Pro He Ser Trp His Leu Ala Thr Lys Val Leu 
225 230 235 240 

Gly He Leu Cys Cys Gly Leu Phe Phe Gly He Val Gly Leu Lys He 
245 250 255 

Phe Phe Ser Lys Phe Gin Trp Lys He Gin Ala Glu Leu Asp Trp Arg 
260 265 270 

Arg Lys His Gly Gin Ala Glu Leu Arg Asp Ala Arg Lys His Ala Val 
275 280 285 

Glu Val Thr Leu Asp Pro Glu Thr Ala His Pro Lys Leu Cys Val Ser 
290 295 300 

Asp Leu Lys Thr Val Thr His Arg Lys Ala Pro Gin Glu Val Pro His 
305 310 315 320 

Ser Glu Lys Arg Phe Thr Arg Lys Ser Val Val Ala Ser Gin Ser Phe 
325 330 335 

Gin Ala Gly Lys His Tyr Trp Glu Val Asp Gly Gly His Asn Lys Arg 
340 345 350 

Trp Arg Val Gly Val Cys Arg Asp Asp Val Asp Arg Arg Lys Glu Tyr 
355 360 365 

Val Thr Leu Ser Pro Asp His Gly Tyr Trp Val Leu Arg Leu Asn Gly 
370 375 380 

Glu His Leu Tyr Phe Thr Leu Asn Pro Arg Phe He Ser Val Phe Pro 
385 390 395 400 

Arg Thr Pro Pro Thr Lys He Gly Val Phe Leu Asp Tyr Glu Cys Gly 
405 410 415 

Thr He Ser Phe Phe Asn He Asn Asp Gin Ser Leu lie Tyr Thr Leu 
420 425 430 

Thr Cys Arg Phe Glu Gly Leu Leu Arg Pro Tyr He Glu Tyr Pro Ser 
435 440 445 

Tyr Asn Glu Gin Asn Gly Thr Pro Arg Asp Lys Gin Gin 
450 455 460 



<210> 120 
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<211> 1152 

<212> DNA 

<213> Homo sapiens 

<400> 120 

ACCTTTTTCG AGCCTATATC GTGGCACCTG GCTACCAAAG TACTGGGAAT ACTCTGCTGT 60 

GGCCTATTTT TTGGCATTGT TGGACTGAAG ATTTTCTTCT CCAAATTCCA GTGGAAAATC 120 

CAGGCGGAAC TGGACTGGAG AAGAAAGCAC GGACAGGCAG AATTGAGAGA CGCCCGGAAA 180 

CACGCAGTGG AGGTGACTCT GGATCCAGAG ACGGCTCACC CGAAGCTCTG CGTTTCTGAT 240 

CTGAAAACTG TAACCCATAG AAAAGCTCCC CAGGAGGTGC CTCACTCTGA GAAGAGATTT 300 

ACAAGGAAGA GTGTGGTGGC TTCTCAGAGT TTCCAAGCAG GGAAACATTA CTGGGAGGTG 360 

GACGGAGGAC ACAATAAAAG GTGGCGCGTG GGAGTGTGCC GGGATGATGT GGACAGGAGG 420 

AAGGAGTACG TGACTTTGTC TCCCGATCAT GGGTACTGGG TCCTCAGACT GAATGGAGAA 480 

CATTTGTATT TCACATTAAA TCCCCGTTTT ATCAGCGTCT TCCCCAGGAC CCCACCTACA 540 

AAAATAGGGG TCTTCCTGGA CTATGAGTGT GGGACCATCT CCTTCTTCAA CATAAATGAC 600 

CAGTCCCTTA TTTATACCCT GACATGTCGG TTTGAAGGCT TATTGAGGCC CTACATTGAG 660 

TATCCGTCCT ATAATGAGCA AAATGGAACT CCCAGAGACA AGCAACAGTG AGTCCTCCTC 720 

ACAGGCAACC ACGCCCTTCC TCCCCAGGGG TGAAATGTAG GATGAATCAC ATCCCACATT 780 

CTTCTTTAGG GATATTAAGG TCTCTCTCCC AGATCCAAAG TCCCGCAGCA GCCGGCCAAG 840 

GTGGCTTCCA GATGAAGGGG GACTGGCCTG TCCACATGGG AGTCAGGTGT CATGGCTGCC 900 

CTGAGCTGGG AGGGAAGAAG GCTGACATTA CATTTAGTTT GCTCTCACTC CATCTGGCTA 960 

AGTGATCTTG AAATACCACC TCTCAGGTGA AGAACCGTCA GGAATTCCCA TCTCACAGGC 1020 

TGTGGTGTAG ATTAAGTAGA CAAGGAATGT GAATAATGCT TAGATCTTAT TGATGACAGA 1080 

GTGTATCCTA ATGGTTTGTT CATTATATTA CACTTTCAGT AAAAAAAAAA AAAAAAAAAA 1140 

AAAAAAAAAA AA 1152 



<210> 121 
<211> 526 
<212> PRT 
<213> Homo sapiens 

<400> 121 

Met Ala Val Phe Pro Asn Ser Cys Leu Ala Gly Cys Leu Leu lie Phe 
15 10 15 

He Leu Leu Gin Leu Pro Lys Leu Asp Ser Ala Pro Phe Asp Val He 
20 25 30 
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Gly Pro Gin Glu Pro He Leu Ala Val Val Gly Glu Asp Ala Glu Leu 
35 40 45 

Pro Cys Arg Leu Ser Pro Asn Val Ser Ala Lys Gly Met Glu Leu Arg 
50 55 60 

Trp Phe Arg Glu Lys Val Ser Pro Ala Val Phe Val Ser Arg Glu Gly 
65 70 75 80 

Gin Glu Gin Glu Gly Glu Glu Met Ala Glu Tyr Arg Gly Arg Val Ser 
85 90 95 

Leu Val Glu Asp His He Ala Glu Gly Ser Val Ala Val Arg He Gin 
100 105 no 

Glu Val Lys Ala Ser Asp Asp Gly Glu Tyr Arg Cys Phe Phe Arg Gin 
115 120 125 

Asp Glu Asn Tyr Glu Glu Ala He Val His Leu Lys Val Ala Ala Leu 
130 135 140 

Gly Ser Asp Pro His He Ser Met Lys Val Gin Glu Ser Gly Glu He 
145 150 155 160 

Gin Leu Glu Cys Thr Ser Val Gly Trp Tyr Pro Glu Pro Gin Val Gin 
165 170 175 

Trp Arg Thr His Arg Gly Glu Glu Phe Pro Ser Met Ser Glu Ser Arg 
180 185 190 

Asn Pro Asp Glu Glu Gly Leu Phe Thr Val Arg Ala Ser Val He He 
195 200 205 

Arg Asp Ser Ser Met Lys Asn Val Ser Cys Cys He Arg Asn Leu Leu 
210 215 220 

Leu Gly Gin Glu Lys Asp Val Glu Val Ser He Pro Ala Ser Phe Phe 
225 230 235 .240 

Pro Arg Leu Thr Pro Trp Met Val Ala Val Ala Val He Leu Val Val 
245 250 255 

Leu Gly Leu Leu Thr He Gly Ser He Phe Phe Thr Trp Arg Leu Tyr 
260 265 270 

Lys Glu Arg Ser Arg Gin Arg Arg Asn Glu Phe Ser Ser Lys Glu Lys 
275 280 285 

Leu Leu Glu Glu Leu Lys Trp Lys Arg Ala Thr Leu His Ala Val Asp 
290 295 300 

Val Thr Leu Asp Pro Asp Thr Ala His Pro His Leu Phe Leu Tyr Glu 
305 310 315 320 

Asp Ser Lys Ser Val Arg Leu Glu Asp Ser Arg Gin Lys Leu Pro Glu 
325 330 335 
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Lys Pro Glu Arg Phe Asp Ser Trp Pro Cys Val Met Gly Arg Glu Ala 
340 345 350 

Phe Thr Ser Gly Arg His Tyr Trp Glu Val Glu Val Gly Asp Arg Thr 
355 360 365 

Asp Trp Ala He Gly Val Cys Arg Glu Asn Val Met Lys Lys Gly Phe 
370 375 380 

Asp Pro Met Thr Pro Glu Asn Gly Phe Trp Ala Val Glu Leu Tyr Gly 
385 390 395 400 

Asn Gly Tyr Trp Ala Leu Thr Pro Leu Arg Thr Pro Leu Pro Leu Ala 
405 410 415 

Gly Pro Pro Arg Arg Val Gly Val Phe Leu Asp Tyr Glu Ser Gly Asp 
420 425 430 

He Phe Phe Tyr Asn Met Thr Asp Gly Ser His He Tyr Thr Phe Ser 
435 440 445 

Lys Ala Ser Phe Ser Gly Pro Leu Arg Pro Phe Phe Cys Leu Trp Ser 
450 455 460 

Cys Gly Lys Lys Pro Leu Thr He Cys Pro Val Thr Asp Gly Leu Glu 
465 470 475 480 

Gly Val Met Val Val Ala Asp Ala Lys Asp He Ser Lys Glu He Pro 
485 490 495 

Leu Ser Pro Met Gly Glu Asp Ser Ala Ser Gly Asp He Glu Thr Leu 
500 505 510 

His Ser Lys Leu He Pro Leu Gin Pro Ser Gin Gly Val Pro 
515 520 525 



<210> 122 

<211> 469 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (63) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (289) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (338) 

<223> n equals a,t,g, or c 



<220> 
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<221> SITE 
<222> (357) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (358) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (362) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (364) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (381) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (389) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (391) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (437) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (447) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (454) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (458) 

<223> n equals a,t,g, or c 
<400> 122 

GGCAGAGCCT GTCATCCGTT TCCATGCCGT GAGGTCCATT CACAGAACAC ATCCATGGCT 60 



CTNATGCTCA GTTTGGTTCT GAGTCTCCTC AAGCTGGGAT CAGGGCAGTG GCAGGTGTTT 



120 
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GGGCCAGACA AGCCTGTCCA GGCCTTGGTG GGGGAGGACG CAGCATTCTC CTGTTTCCTG 180 

TCTCCTAAGA ACCAATGCAG AGGCCATGGT AAGTGCGGTT CTTCAGGGGC CAGTTTCTCT 240 

AGCGTGGTCC ACCTCTTACA GGGACGGGAA GGACCAGCCC ATTTTATGNC AGATGGCCAC 300 

AGTATTCAAG GCCAGGACAA AAAATGGGTG GAAGGTTNCT ATTTGCGGAG GGGCGCNNTT 360 

TNTNTTGAGG GTGGGAAAAA NATTTACTNT NTTTGGATTG CTGGGCCCCT TATGGGTGCC 420 

AGGGTTAAGT TTCCCANTTT TTAATTNACC CGANAGGNCC ATTTGGGGG 469 



<210> 123 
<211> 368 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> SITE 

<222> (81) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (109) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (133) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (210) 

*--<223> n equals a,t,g, or c 

^220> 

<221> SITE 

<222> (228) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (242) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (245) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (249) 
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<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (251) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (268) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (329) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (335) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (339) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (353) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (362) 

<223> n equals a,t,g, or c 
<400> 123 

GGCAGAGAGG ATTTGTCCAC AGACTCCAGG ACAAACAGAG ACATGCATGG CCTGTTTGAT 60 
GTGGAGATCT CTCTGACCGT NCAAGAGAAC GCCGGGAGCA TATCCTGTNC CATGCGGCAT 120 
GCTCATCTAA GCNCGAGAAG GTGGGAATCC AGGGTACAGA TAGGAGATAC CTTTTCGAAG 180 
CCTATATCGT GGCACCTGGC TACCAAAGTN CTGGGAAATA CTCTGCTNGT GGCCTATTTT 240 
TNGGNATTNT NGGACTAAAG ATATTTTNCT CCAAATTCCA GTGTAAGCAA GGGAGAAGGG 300 
GCATGGGCCC GTGCCTTATT CCATGGTTNC CAGCNGGGNC AGGATCAGAG ATNGCTCCCA 360 
CNTCCAGT o fi o 



<210> 124 
<211> 27 
<212> PRT 

<213> Homo sapiens 
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<400> 124 

Gly Thr Leu Val Ala Glu Lys His Val Leu Thr Ala Ala His Cys lie 
1 5 10 is 

His Asp Gly Lys Thr Tyr Val Lys Gly Thr Gin 
20 25 



<210> 125 
<211> 413 
<212> PRT 
<213> Homo sapiens 

<400> 125 

"■'Gly Thr Arg Gly Gin Ala Trp Glu Pro Arg Ala Leu Ser Arg Arg Pro 
1 5 io 15 

His Leu Ser Glu Arg Arg Ser Glu Pro Arg Pro Gly Arg Ala Ala Arg 
20 25 30 

Arg Gly Thr Val Leu Gly Met Ala Gly He Pro Gly Leu Leu Phe Leu 
35 40 45 

Leu Phe Phe Leu Leu Cys Ala Val Gly Gin Val Ser Pro Tyr Ser Ala 
50 55 60 

Pro Trp Lys Pro Thr Trp Pro Ala Tyr Arg Leu Pro Val Val Leu Pro 
65 70 75 80 

Gin Ser Thr Leu Asn Leu Ala Lys Pro Asp Phe Gly Ala Glu Ala Lys 
85 90 95 

Leu Glu Val Ser Ser Ser Cys Gly Pro Gin Cys His Lys Gly Thr Pro 
100 105 no 

Leu Pro Thr Tyr Glu Glu Ala Lys Gin Tyr Leu Ser Tyr Glu Thr Leu 
115 120 125 

Tyr Ala Asn Gly Ser Arg Thr Glu Thr Gin Val Gly He Tyr He Leu 
130 135 140 

Ser Ser Ser Gly Asp Gly Ala Gin His Arg Asp Ser Gly Ser Ser Gly 
145 150 155 160 

Lys Ser Arg Arg Lys Arg Gin He Tyr Gly Tyr Asp Ser Arg Phe Ser 
165 170 175 

He Phe Gly Lys Asp Phe Leu Leu Asn Tyr Pro Phe Ser Thr Ser Val 
180 185 190 

Lys Leu Ser Thr Gly Cys Thr Gly Thr Leu Val Ala Glu Lys His Val 
195 200 205 

Leu Thr Ala Ala His Cys He His Asp Gly Lys Thr Tyr Val Lys Gly 
210 215 220 

Thr Gin Lys Leu Arg Val Gly Phe Leu Lys Pro Lys Phe Lys Asp Gly 
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225 



230 



235 



240 



Gly Arg Gly Ala Asn Asp Ser Thr Ser Ala Met Pro Glu Gin Met Lys 
245 250 255 

Phe Gin Trp He Arg Val Lys Arg Thr His Val Pro Lys Gly Trp He 
260 265 270 

Lys Gly Asn Ala Asn Asp He Gly Met Asp Tyr Asp Tyr Ala Leu Leu 
275 280 285 

Glu Leu Lys Lys Pro His Lys Arg Lys Phe Met Lys He Gly Val Ser 
290 295 300 

Pro Pro Ala Lys Gin Leu Pro Gly Gly Arg He His Phe Ser Gly Tyr 
305 310 315 320 

Asp Asn Asp Arg Pro Gly Asn Leu Val Tyr Arg Phe Cys Asp Val Lys 
325 330 335 

Asp Glu Thr Tyr Asp Leu Leu Tyr Gin Gin Cys Asp Ser Gin Pro Gly 
340 345 350 

Ala Ser Gly Ser Gly Val Tyr Val Arg Met Trp Lys Arg Gin His Gin 
355 360 365 

Lys Trp Glu Arg Lys He He Gly Met He Ser Gly His Gin Trp Val 
370 375 380 

Asp Met Asp Gly Ser Pro Gin Glu Phe Thr Arg Gly Cys Ser Glu He 
385 390 395 400 

Thr Pro Leu Gin Tyr He Pro Asp He Ser He Gly Val 



<210> 126 
<211> 383 
<212> PRT 

<213> Homo sapiens 
<400> 126 

Met Ala Gly He Pro Gly Leu Leu Phe Leu Leu Phe Phe Leu Leu Cys 
15 10 15 

Ala Val Gly Gin Val Ser Pro Tyr Ser Ala Pro Trp Lys Pro Thr Trp 
20 25 30 

Pro Ala Tyr Arg Leu Pro Val Val Leu Pro Gin Ser Thr Leu Asn Leu 
35 40 45 

Ala Lys Pro Asp Phe Gly Ala Glu Ala Lys Leu Glu Val Ser Ser Ser 
50 55 60 

Cys Gly Pro Gin Cys His Lys Gly Thr Pro Leu Pro Thr Tyr Glu Glu 
65 70 75 80 

Ala Lys Gin Tyr Leu Ser Tyr Glu Thr Leu Tyr Ala Asn Gly Ser Arg 



405 



410 
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85 90 95 

Thr Glu Thr Gin Val Gly He Tyr He Leu Ser Ser Ser Gly Asp Gly 
100 105 no 

Ala Gin His Arg Asp Ser Gly Ser Ser Gly Lys Ser Arg Arg Lys Arg 
H5 120 125 

Gin He Tyr Gly Tyr Asp Ser Arg Phe Ser He Phe Gly Lys Asp Phe 
130 135 140 

Leu Leu Asn Tyr Pro Phe Ser Thr Ser Val Lys Leu Ser Thr Gly Cys 
145 150 155 160 

.Thr Gly Thr Leu Val Ala Glu Lys His Val Leu Thr Ala Ala His Cys 
165 170 175 

lie His Asp Gly Lys Thr Tyr Val Lys Gly Thr Gin Lys Leu Arg Val 
180 185 190 

Gly Phe Leu Lys Pro Lys Phe Lys Asp Gly Gly Arg Gly Ala Asn Asp 
195 200 205 

Ser Thr Ser Ala Met Pro Glu Gin Met Lys Phe Gin Trp He Arg Val 
210 215 220 

Lys Arg Thr His Val Pro Lys Gly Trp He Lys Gly Asn Ala Asn Asp 
225 230 235 240 

He Gly Met Asp Tyr Asp Tyr Ala Leu Leu Glu Leu Lys Lys Pro His 
245 250 255 

Lys Arg Lys Phe Met Lys He Gly Val Ser Pro Pro Ala Lys Gin Leu 
260 265 270 

Pro Gly Gly Arg He His Phe Ser Gly Tyr Asp Asn Asp Arg Pro Gly 
275 280 285 

Asn Leu Val Tyr Arg Phe Cys Asp Val Lys Asp Glu Thr Tyr Asp Leu 
290 295 300 

Leu Tyr Gin Gin Cys Asp Ala Gin Pro Gly Ala Ser Gly Ser Gly Val 
305 310 315 320 

Tyr Val Arg Met Trp Lys Arg Gin Gin Gin Lys Trp Glu Arg Lys He 
325 330 335 

He Gly He Phe Ser Gly His Gin Trp Val Asp Met Asn Gly Ser Pro 
340 345 350 

Gin Asp Phe Asn Val Ala Val Arg He Thr Pro Leu Lys Tyr Ala Gin 
355 360 365 

He Cys Tyr Trp He Lys Gly Asn Tyr Leu Asp Cys Arg Glu Gly 
370 375 380 



<210> 127 
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<211> 269 
<212> PRT 

<213> Homo sapiens 
<400> 127 

Met He Arg Thr Leu Leu Leu Ser Thr Leu Val Ala Gly Ala Leu Ser 
15 io 15 

Cys Gly Asp Pro Thr Tyr Pro Pro Tyr Val Thr Arg Val Val Gly Gly 
20 25 30 

Glu Glu Ala Arg Pro Asn Ser Trp Pro Trp Gin Val Ser Leu Gin Tyr 
35 40 45 

Ser Ser Asn Gly Lys Trp Tyr His Thr Cys Gly Gly Ser Leu He Ala 
50 55 60 

Asn Ser Trp Val Leu Thr Ala Ala His Cys He Ser Ser Ser Arg Thr 
65 70 75 80 

Tyr Arg Val Gly Leu Gly Arg His Asn Leu Tyr Val Ala Glu Ser Gly 
85 90 95 

Ser Leu Ala Val Ser Val Ser Lys He Val Val His Lys Asp Trp Asn 
100 105 no 

Ser Asn Gin He Ser Lys Gly Asn Asp He Ala Leu Leu Lys Leu Ala 
115 120 125 

Asn Pro Val Ser Leu Thr Asp Lys lie Gin Leu Ala Cys Leu Pro Pro 
130 135 140 

Ala Gly Thr He Leu Pro Asn Asn Tyr Pro Cys Tyr Val Thr Gly Trp 
145 150 155 160 

Gly Arg Leu Gin Thr Asn Gly Ala Val Pro Asp Val Leu Gin Gin Gly 
165 170 175 

Arg Leu Leu Val Val Asp Tyr Ala Thr Cys Ser Ser Ser Ala Trp Trp 
180 185 190 

Gly Ser Ser Val Lys Thr Ser Met He Val Ala Gly Gly Asp Gly Val 
195 200 205 

He Ser Ser Cys Asn Gly Asp Ser Gly Gly Pro Leu Asn Cys Gin Ala 
210 215 220 

Ser Asp Gly Arg Trp Gin Val His Gly He Val Ser Phe Gly Ser Arg 
225 230 235 240 

Leu Gly Cys Asn Tyr Tyr His Lys Pro Ser Val Phe Thr Arg Val Ser 
245 250 255 

Asn Tyr He Asp Trp He Asn Ser Val He Ala Asn Asn 
260 265 



<210> 128 
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<211> 328 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (30) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (216) 

<223> n equals a,t,g, or c 
<400> 128 

TGAAACGCAC CCATGTGCCC AAGGGTTGGN TCAAGGGCAA TGCCAATGAC ATCGGCATGG 60 
ATTATGATTA TGCCCTCCTG GAACTCAAAA AGCCCCACAA GAGAAAATTT ATGAAGATTG 120 
GGGTGAGCCC TCCTGCTAAG CAGCTGCCAG GGGGCAGAAT TCACTTCTCT GGTTATGACA 180 
ATGACCGACC AGGCAAT7TG GTGTATCGCT TCTGTNACGT CAAAGACGAG ACCTATGACT 240 
TGCTCTACCA GCAATGCGAT GCCCAGCCAG GGGCCAGCGG GTCTGGGGTC TATGTTAGGA 300 
TGTGGGAGAG ACAGCAGCAT GAAGTGGG 328 



<210> 129 
<211> 495 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (23) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (260) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (291) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (295) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (305) 

<223> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> (319) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (331) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (352) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (364) 

<223> n equals a,t>g, or c 
<220> 

<221> SITE 
<222> (390) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (400) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (402) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (406) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (428) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (429) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (448) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
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<222> (461) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (463) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (465) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (469) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (470) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (476) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (482) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (489) 

<223> n equals a,t,g, or c 
<400> 129 

AATTCGGCAC AGGCTCGGCC CGNAGGACTC AGGGGCTTCA GGAAAGTCTC GAAGGAAGCG 60 
GCAGATTTAT GGCTATGACA GCAGGTTCAG CATTTTTGGG AAGGACTTCC TGCTCAACTA 120 
CCCTTTCTCA ACATCAGTGA AGTTATCCAC GGGCTGCACC GGCACCCTGG TGGGCAGAGA 180 
AGCATGTCCT CACAGCTGCC CACTGCATAC ACGATGGAAA AACCTATGTG AAAGGAACCC 240 
AGAAGCTTCG AGTGGGCTTN CTAAAGCCCA AGTTTAAAGA TGGTGGTCGA NGGGNCAACG 300 
ACTTNCACTT CAGCCATGNC CGAGCAGATG NATTTTCAGT GGGTCCGGTG GNAAGGAACC 360 
CTGNTGCCCA AGGGTTTGGG TTCAGGGGGN ATGGCATGAN CNAGGNATGG GTTTATGATT 420 
ATGCCTCNNG GGAATTCAAA AGGCCCANAA GGGGAAATTT NTNANGGTNN GGGTGNGGCC 480 
TNCTGTTANG AAATT 495 
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<210> 130 
<211> 288 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (12) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (24) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (31) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (69) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (102) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (200) 

<223> n equals a,t,g, or c 
<400> 130 

CCCACACCTG TNTGAGCGGC GCANGAGCCG NGGCCCGGGC GGGCTGCTCG GCGCGGAACA 60 
GTGCTCGGNA TGGCAGGGAT TCCAGGGCTC CTCTTCCTTC TNTTCTTTCT GCTCTGTGCT 120 
GTTGGGCAAG TAAGCCCTTA CAGTGCCCCC TGGAAACCCA CTTGGCCTGC ATACCGCCTC 180 
CCTGTCGTCT TGCCCCAGTN TACCCTCAAT TTAGCCAAGC CAGACTTTGG AGCCGAAGCC 240 
AAATTAGAAG TATCTTCTTC ATGTGGACCC CAGTGTCATA AGGGAACT 288 



<210> 131 
<211> 423 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (5) 

<223> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> (49) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (61) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (79) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (90) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (224) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (251) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (272) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (300) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (308) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (315) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (363) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 



WO 00/29435 

<222> (375) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (392) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (396) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (405) 

<223> n equals a,t,g, or c 
<400> 131 



GGCANAGGGT 


GGTCGAGGGG 


CCAACGACTC 


CACTTCAGCC 


ATGCCCGANC 


AGATGAAATT 


60 


NCAGTGGATC 


CGGGTGAANG 


CACCCATGTN 


GCCAAGGGTT 


GGATCAAGGG 


CAATGCCAAT 


120 


GAACATCGGC 


ATGGATTATG 


AATTATGCCC 


CTCCTGGGAA 


CTGCAAAAAG 


CCCCACAAGA 


180 


GAAAATTTAT 


GAAAGATTGG 


GGTGGAGCCC 


TCCTGCTTAA 


GCANCTGCCA 


GGGGGCAGAA 


240 


TTGCACTTCT 


NTGGTTATGA ACAATGAACC GNCCAGGGCA ATTTGGTGTG 


ATCGCTTCTN 


300 


TGAACGTNCA 


AAGANGAGGA 


CCTATGGACT 


TGGTTCTTAC 


CCAGCAATGG 


GGATTGCCCC 


360 


AGNCCAGGGG 


GCCANCGGGG 


TTTGGGGGGT 


TNTTTNTTTT 


AAGGNTTTTT 


GGGAAGGGGG 


420 


CCA 












423 
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<210> 132 

<211> 145 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (66) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (76) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (104) 

<223> n equals a,t,g, or c 



<220> 
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<221> SITE 
<222> (112) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (134) 

<223> n equals a,t,g, or c 



<400> 132 

TTGCATAGAA ATAAAAAAAA TACTGATTTG GGGCAATGAG GAATATTTGA CAATTAAGTT 
AATCTNCACG TTTTTNCAAA CTTTGGATTT TTATTTCATC TGANCTTGTT TNAAAGATTT 
ATATTAAATA TTTNGCATAC AAGAG 



<210> 133 
<211> 340 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> SITE 
<222> (260) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (295) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (313) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (314) 

<223> n equals a,t,g, or c 



<400> 133 

CAGAAGTGGG AGCGAAAAAT TATTGGCATT TTTTCAGGGC ACCAGTGGGT GGACATGAAT 
GGTTCCCCAC AGGATTTCAA CGTGGCTGTC AGAATCACTC CTCTCAAATA TGCCCAGATT 
TGCTATTGGA TTAAAGGAAA CTACCTGGAT TGTAGGGAGG GGTGACACAG TGTTCCCTCC 
TGGCAGCAAT TAAGGGTCTT CATGTTCTTA TTTTAGGAGA GGCCAAATTG TTTTTTGTCA 
TTGGCGTGCA CACGTGTGTN TGGTGTGTGT GGTGTGTGTG TAAAGGTGTC TTATNAATCT 



TTTTACCCAT TTNNTTACAA TTGCAAGATG ACTGGCTTTA 



340 
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<210> 134 

<211> 99 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (51) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (56) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (57) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (59) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (79) 

<223> n equals a,t,g, or c 
<400> 134 

GGAGCCGCGC GCTCTCTCCC GGCGCCCACA CCTGTCTGAG CGGCGCACGA NGCCGNNGNC 
CCGGGCGGGC TGCTCGGCNC GGAACAGTGC TCGGCATGG 



<210> 135 

<211> 161 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (4) 

<223> n equals a,"t,g, or c 
<220> 

<221> SITE 
<222> (12) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (28) 

<223> n equals a,t,g, or c 



<220> 
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<221> SITE 
<222> (58) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (101) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (109) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (152) 

'<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (156) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (157) 

<223> n equals a,t,g, or c 
<400> 135 

GCANCGAGCC GNGGCCCGGG CGGGCTGNTC GGCGCGGAAC AGTGCTCCGG CATGGCANTA 
TATTCCAGGG CTCCTCTTCC TTCTGTTCTT TCTGCTTATG NGCTGTTGNG CAAGTAGAGC 
CCTTACAGTT GCCCACTGTA AACCCACTTG GNCTGNNTAC C 



<210> 136 
<211> 442 
<212> PRT 
<213> Homo sapiens 

<400> 136 

Met Ala Ser Val Val Leu Pro Ser Gly Ser Gin Cys Ala Ala Ala Ala 
1 5 10 15 

Ala Ala Ala Ala Pro Pro Gly Leu Arg Leu Arg Leu Leu Leu Leu Leu 
20 25 30 

Phe Ser Ala Ala Ala Leu He Pro Thr Gly Asp Gly Gin Asn Leu Phe 
3 5 40 45 

Thr Lys Asp Val Thr Val He Glu Gly Glu Val Ala Thr He Ser Cys 
50 55 60 

Gin Val Asn Lys Ser Asp Asp Ser Val He Gin Leu Leu Asn Pro Asn 
65 7 0 75 80 
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Arg Gin Thr He Tyr Phe Arg Asp Phe Arg Pro Leu Lys Asp Ser Arg 
85 90 95 

Phe Gin Leu Leu Asn Phe Ser Ser Ser Glu Leu Lys Val Ser Leu Thr 
100 105 110 

Asn Val Ser He Ser Asp Glu Gly Arg Tyr Phe Cys Gin Leu Tyr Thr 
115 120 125 

Asp Pro Pro Gin Glu Ser Tyr Thr Thr He Thr Val Leu Val Pro Pro 
130 135 140 

Arg Asn Leu Met He Asp He Gin Lys Asp Thr Ala Val Glu Gly Glu 
145 150 155 160 

Glu He Glu Val Asn Cys Thr Ala Met Ala Ser Lys Pro Ala Thr Thr 
165 170 175 

He Arg Trp Phe Lys Gly Asn Thr Glu Leu Lys Gly Lys Ser Glu Val 
180 185 190 

Glu Glu Trp Ser Asp Met Tyr Thr Val Thr Ser Gin Leu Met Leu Lys 
195 200 205 

Val His Lys Glu Asp Asp Gly Val Pro Val He Cys Gin Val Glu His 
210 215 220 

Pro Ala Val Thr Gly Asn Leu Gin Thr Gin Arg Tyr Leu Glu Val Gin 
225 230 235 240 

Tyr Lys Pro Gin Val His He Gin Met Thr Tyr Pro Leu Gin Gly Leu 
245 250 255 

Thr Arg Glu Gly Asp Ala Leu Glu Leu Thr Cys Glu Ala He Gly Lys 
260 265 270 

Pro Gin Pro Val Met Val Thr Trp Val Arg Val Asp Asp Glu Met Pro 
275 2.80 285 

Gin His Ala Val Leu Ser Gly Pro Asn Leu Phe He Asn Asn Leu Asn 
290 295 300 

Lys Thr Asp Asn Gly Thr Tyr Arg Cys Glu Ala Ser Asn He Val Gly 
305 310 315 320 

Lys Ala His Ser Asp Tyr Met Leu Tyr Val Tyr Asp Pro Pro Thr Thr 
325 330 335 

He Pro Pro Pro Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
340 345 350 



Thr lie Leu Thr He He Thr Asp Ser Arg Ala Gly Glu Glu Gly Ser 
355 360 365 



He Arg Ala Val Asp His Ala Val lie Gly Gly Val Val Ala Val Val 
370 375 380 
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Val Phe Ala Met Leu Cys Leu Leu He He Leu Gly Arg Tyr Phe Ala 
385 390 395 400 

Arg His Lys Gly Thr Tyr Phe Thr His Glu Ala Lys Gly Ala Asp Asp 
405 410 415 

Ala Ala Asp Ala Asp Thr Ala He He Asn Ala Glu Gly Gly Gin Asn 
420 425 430 

Asn Ser Glu Glu Lys Lys Glu Tyr Phe He 
435 440 



<210> 137 

<211> 1329 

<212> DNA 

<213> Homo sapiens 

<400> 137 

ATGGCGAGTG TAGTGCTGCC GAGCGGATCC CAGTGTGCGG CGGCAGCGGC GGCGGCGGCG 60 

CCTCCCGGGC TCCGGCTCCG GCTTCTGCTG TTGCTCTTCT CCGCCGCGGC ACTGATCCCC 120 

ACAGGTGATG GGCAGAATCT GTTTACGAAA GACGTGACAG TGATCGAGGG AGAGGTTGCG 180 

ACCATCAGTT GCCAAGTCAA TAAGAGTGAC GACTCTGTGA TTCAGCTACT GAATCCCAAC 240 

AGGCAGACCA TTTATTTCAG GGACTTCAGG CCTTTGAAGG ACAGCAGGTT TCAGTTGCTG 300 

AATTTTTCTA GCAGTGAACT CAAAGTATCA TTGACAAACG TCTCAATTTC TGATGAAGGA 360 

AGATACTTTT GCCAGCTCTA TACCGATCCC CCACAGGAAA GTTACACCAC CATCACAGTC 420 

CTGGTCCCAC CACGTAATCT GATGATCGAT ATCCAGAAAG ACACTGCGGT GGAAGGTGAG 480 

GAGATTGAAG TCAACTGCAC TGCTATGGCC AGCAAGCCAG CCACGACTAT CAGGTGGTTC 540 

AAAGGGAACA CAGAGCTAAA AGGCAAATCG GAGGTGGAAG AGTGGTCAGA CATGTACACT 600 

GTGACCAGTC AGCTGATGCT GAAGGTGCAC AAGGAGGACG ATGGGGTCCC AGTGATCTGC 660 

CAGGTGGAGC ACCCTGCGGT CACTGGAAAC CTGCAGACCC AGCGGTATCT AGAAGTACAG 720 

TATAAGCCTC AAGTGCACAT TCAGATGACT TATCCTCTAC AAGGCTTAAC CCGGGAAGGG 780 

GACGCGCTTG AGTTAACATG TGAAGCCATC GGGAAGCCCC AGCCTGTGAT GGTAACTTGG 840 

GTGAGAGTCG ATGATGAAAT GCCTCAACAC GCCGTACTGT CTGGGCCCAA CCTGTTCATC 900 

AATAACCTAA ACAAAACAGA TAATGGTACA TACCGCTGTG AAGCTTCAAA CATAGTGGGG 960 

AAAGCTCACT CGGATTATAT GCTGTATGTA TACGATCCCC CCACAACTAT CCCTCCTCCC 1020 

ACAACAACCA CCACCACCAC CACCACCACC ACCACCACCA TCCTTACCAT CATCACAGAT 1080 

TCCCGAGCAG GTGAAGAAGG CTCGATCAGG GCAGTGGATC ATGCCGTGAT CGGTGGCGTC 1140 

GTGGCGGTGG TGGTGTTCGC CATGCTGTGC TTGCTCATCA TTCTGGGGCG CTATTTTGCC 1200 
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AGACATAAAG GTACATACTT CACTCATGAA GCCAAAGGAG CCGATGACGC AGCAGACGCA 1260 
GACACAGCTA TAATCAATGC AGAAGGAGGA CAGAACAACT CCGAAGAAAA GAAAGAGTAC 1320 
TTCATCTAG nno 



<210> 138 
<211> 538 
<212> PRT 
<213> Homo sapiens 

<400> 138 

Met Ala Arg Ala Ala Ala Leu Leu Pro Ser Arg Ser Pro Pro Thr Pro 
1 5 io is 

Leu Leu Trp Pro Leu Leu Leu Leu Leu Leu Leu Glu Thr Gly Ala Gin 
20 25 30 

Asp Val Arg Val Gin Val Leu Pro Glu Val Arg Gly Gin Leu Gly Gly 
35 40 45 

Thr Val Glu Leu Pro Cys His Leu Leu Pro Pro Val Pro Gly Leu Tyr 
50 55 60 

lie Ser Leu Val Thr Trp Gin Arg Pro Asp Ala Pro Ala Asn His Gin 
65 70 75 80 

Asn Val Ala Ala Phe His Pro Lys Met Gly Pro Ser Phe Pro Ser Pro 
85 90 95 

Lys Pro Gly Ser Glu Arg Leu Ser Phe Val Ser Ala Lys Gin Ser Thr 
100 105 HO 

Gly Gin Asp Thr Glu Ala Glu Leu Gin Asp Ala Thr Leu Ala Leu His 
115 120 125 

Gly Leu Thr Val Glu Asp Glu Gly Asn Tyr Thr Cys Glu Phe Ala Thr 
130 135 140 

Phe Pro Lys Gly Ser Val Arg Gly Met Thr Trp Leu Arg Val He Ala 
145 150 155 160 

Lys Pro Lys Asn Gin Ala Glu Ala Gin Lys Val Thr Phe Ser Gin Asp 
165 170 175 

Pro Thr Thr Val Ala Leu Cys He Ser Lys Glu Gly Arg Pro Pro Ala 
180 185 190 

Arg He Ser Trp Leu Ser Ser Leu Asp Trp Glu Ala Lys Glu Thr Gin 
195 200 205 

Val Ser Gly Thr Leu Ala Gly Thr Val Thr Val Thr Ser Arg Phe Thr 
210 215 220 

Leu Val Pro Ser Gly Arg Ala Asp Gly Val Thr Val Thr Cys Lys Val 
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225 



230 



235 



240 



Glu His Glu Ser Phe Glu Glu Pro Ala Leu He Pro Val Thr Leu Ser 
245 250 255 

Val Arg Tyr Pro Pro Glu Val Ser He Ser Gly Tyr Asp Asp Asn Trp 
260 265 270 

Tyr Leu Gly Arg Thr Asp Ala Thr Leu Ser Cys Asp Val Arg Ser Asn 
275 280 285 

Pro Glu Pro Thr Gly Tyr Asp Trp Ser Thr Thr Ser Gly Thr Phe Pro 
290 295 300 

Thr Ser Ala Val Ala Gin Gly Ser Gin Leu Val He His Ala Val Asp 
305 310 315 320 

Ser Leu Phe Asn Thr Thr Phe Val Cys Thr Val Thr Asn Ala Val Gly 
325 330 335 

Met Gly Arg Ala Glu Gin Val He Phe Val Arg Glu Thr Pro Asn Thr 
340 345 350 

Ala Gly Ala Gly Ala Thr Gly Gly He He Gly Gly He He Ala Ala 
355 360 365 

He He Ala Thr Ala Val Ala Ala Thr Gly He Leu He Cys Arg Gin 
370 375 380 

Gin Arg Lys Glu Gin Thr Leu Gin Gly Ala Glu Glu Asp Glu Asp Leu 
385 390 395 400 

Glu Gly Pro Pro Ser Tyr Lys Pro Pro Thr Pro Lys Ala Lys Leu Glu 
405 410 415 

Ala Gin Glu Met Pro Ser Gin Leu Phe Thr Leu Gly Ala Ser Glu His 
420 425 430 

Ser Pro Leu Lys Thr Pro Tyr Phe Asp Ala Gly Ala Ser Cys Thr Glu 
435 440 445 

Gin Glu Met Pro Arg Tyr His Glu Leu Pro Thr Leu Glu Glu Arg Ser 
450 455 460 

Gly Pro Leu His Pro Gly Ala Thr Ser Leu Gly Ser Pro He Pro Val 
465 470 475 480 

Pro Pro Gly Pro Pro Ala Val Glu Asp Val Ser Leu Asp Leu Glu Asp 
485 490 495 

Glu Glu Gly Glu Glu Glu Glu Glu Tyr Leu Asp Lys He Asn Pro He 
500 505 510 



Tyr Asp Ala Leu Ser Tyr Ser Ser Pro Ser Asp Ser Tyr Gin Gly Lys 
515 520 525 



Gly Phe Val Met Ser Arg Ala Met Tyr Val 
530 535 
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<210> 139 
<211> 499 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (76) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (188) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (203) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (238) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (342) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (347) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (359) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (382) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (393) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (395) 

<223> n equals a,t,g, or c 



<220> 
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<221> SITE 
<222> (405) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (406) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (431) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (440) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (444) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (446) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (449) 

<223> n equals a,t ( g, or c 
<220> 

<221> SITE 
<222> (469) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (471) 

<223> n equals a,t,g, or c 



<400> 139 






GACACGAGGC 


AAATCGGAGG TGGAAGAGTG GTCAGACATG TACACTGTGA CCAGTCAGCT 


60 


GATGCTGAAG 


GTGCANAAGG AGGACGATGG GGTCCCAGTG ATCTGCCAGG TGGAGCACCC 


120 


TGCGGTCACT 


GGAAACCTGC AGACCCAGCG GTATCTAGAA GTACAGTATA AGCCTCAAGT 


180 


GCACATTNAG 


ATGACTTATC CTNTACAAGG CTTAACCCGG GAAGGGGACG CGCTTGANTT 


240 


AACATGTGAA 


GCCATCGGGA AGCCCCAGCC TGTGAATGGT AAACTTGGGT GAGAAGTGCG 


300 


ATGATGAAAT 


GCCTCAACAC GCCGTACTGT TTGGGCCCAA CNTGTTNCAT TCATTAACNT 


360 



AAACAAAACA GTTAATGGTA CNTACCGTTG TGNANGTTTC AAACNNAGTG GGGGAAAGTT 



420 
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CATTCGGGTT NTATGCTGTN TGTNTNAGNT TCCCCCCACA ATTTTCCTNC NTCCCACAAC 480 
AACCACCACC ACCACCACC 4g9 



<210> 140 
<211> 245 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (44) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (46) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (152) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (154) 

<223> n equals a,t,g, or c 
<400> 140 

CAAGTTCCAA TATCACTGTC TCTTTATCAT CTAAATAGGG CCANTNGGAC ACCTCATTGA 60 
AACAAAAAGG CTGATCTAGA TGAAGTACTC TTTCTTTTCT TCGGAGTTGT TCTGTCCTCC 120 
TTCTGCATTG ATTATAGCTG TGTCTGCGTC TNCNGCGTCA TCGGCTCCTT TGGCTTCATG 180 
AGTGAAGTAT GTACCTTTAT GTCTGGCAAA ATAGCGCCCC AGAATGATGA GCAAGCACAG 240 
CATGG 245 



<210> 141 
<211> 324 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (5) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (94) 
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<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (148) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (256) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (316) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (317) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (322) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (324) 

<223> n equals a,t,g, or c 
<400> 141 



GGCANAGGGA 


ACACAGAGCT 


AAAAGGCAAA 


TCGGAGGTGG 


AAGAGTGGTC 


AGACATGTAC 


60 


ACTGTGAACC 


AGTCAGCTGA 


TGCTGAAGGT 


GCANAAGGAG 


GACGATGGGG 


TCCCAGTGAA 


120 


TCTGCCAGGT 


GGAGCACCCT 


GCGGTCANTG 


GAAACCTGCA 


GAGCCAGCGG 


TATCTAGAAG 


180 


TACAGTATAA 


GCCTCAAGTG 


CACATTTCAG 


ATGACTTATC 


CTGTACAAGG 


CTTAACCCGG 


240 


AAGGGGACGC 


GCTTTNAGTT 


TAACATGTGA 


AGCCATCGGG 


GAAGCCCCCA 


GCCTGTGAAT 


300 


GGTAAATTTG 


GGTTANNAGT 


CNGN 








324 



<210> 142 
<211> 468 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (89) 

<223> n equals a,t,g, or c 



<220> 
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<221> SITE 
<222> (226) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (242) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (245) 

<223> n equals a # t,g, or c 
<220> 

<221> SITE 
<222> (298) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (314) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (322) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (343) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (348) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (377) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (385) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (408) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (414) 

<223> n equals a # t,g, or c 
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<220> 

<221> SITE 
<222> (421) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (423) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (426) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (428) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (432) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (439) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (442) 

<223> n equals a,t t g, or c 
<220> 

<221> SITE 
<222> (461) 

<223> n equals a,t,g, or c 



<400> 142 



GAATTCGGCA 


GAGCAGGTGC CCGACATGGC GAGTGTACTG CTGCCGAGCG 


GATCCCAGTG 


60 


TGCGGCGGCA 


CGGCGGCGGC GGCGCTGCNC GGGCTCCGGC TCCGGCTTCT 


GCTGTTGCTC 


120 


TTCTCCGCCG 


CGGCACTGAT CCCCACAGGT GATGGGCAGA ATCTGTTTAC 


GAAAGACGTG 


180 


ACAGTGATCG 


AGGGAGAGGT TGCGACCATC AGTTGCCAAG TCAATNAAGA 


GTGACGACTC 


240 


TNTGNATTCA 


GCTACTGAAT CCCAACAGGC AGACCATTTA TTTCAGGGGA 


CTTCAGGNCT 


300 


TTGAAGGACA 


GCANGTTTTC ANTTGCTTGA AATTTTTCTA GCNATTGNAA CTCAAAAGTG 


360 


TTCATTTGAC 


AAACGTNTCA ATTTNTGATG AAGGGAGGTT AATTTTTNGC 


CAANTTTTTT 


420 


NANCCNTNCC 


CNAACAGGNA ANTTACAACC ACCTTTAAAA NTCCTGGT 




468 
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<210> 143 
<211> 279 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (23) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (118) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (190) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (201) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (223) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (238) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (260) 

<223> n equals a,t,g, or c 



<400> 143 
AAATGCCTCA 


ACACGCCGTA 


CTNTCTGGGC 


CCAACCTGTT 


CATCAATAAC 


CTAAACAAAA 


60 


CAGATAATGG 


TACATACCGC 


TGTGAAGCTT 


CAAACATAGT 


GGGGAAAGCT 


CACTCGGNTT 


120 


ATATGCTGTA 


TGTATACGAT 


CCCCCCACAA 


CTATCCCTCC 


TCCCACAACA 


ACCACCACCA 


180 


CCACCACCAN 


CACCACCACC 


NCCATCCTTA 


CCATCATCAC 


AGNTTCCCGA 


GCAGGTGNAG 


240 


AAGGCTCGAT 


CAGGGCAGTN 


GATCATGCCG 


TGATCGGTG 






279 



<210> 144 
<211> 393 
<212> PRT 
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<213> Homo sapiens 
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<400> 144 

Met Trp Trp Arg Val Leu Ser Leu Leu Ala Trp Phe Pro Leu Gin Glu 
15 10 15 

Ala Ser Leu Thr Asn His Thr Glu Thr He Thr Val Glu Glu Gly Gin 
20 25 30 

Thr Leu Thr Leu Lys Cys Val Thr Ser Leu Arg Lys Asn Ser Ser Leu 
35 40 45 

Gin Trp Leu Thr Pro Ser Gly Phe Thr He Phe Leu Asn Glu Tyr Pro 
50 55 60 

"Ala Leu Lys Asn Ser Lys Tyr Gin Leu Leu His His Ser Ala Asn Gin 
65 70 75 80 

Leu Ser He Thr Val Pro Asn Val Thr Leu Gin Asp Glu Gly Val Tyr 
85 90 95 

Lys Cys Leu His Tyr Ser Asp Ser Val Ser Thr Lys Glu Val Lys Val 
100 105 no 

He Val Leu Ala Thr Pro Phe Lys Pro He Leu Glu Ala Ser Val He 
H5 120 125 

Arg Lys Gin Asn Gly Glu Glu His Val Val Leu Met Cys Ser Thr Met 
130 135 140 

Arg Ser Lys Pro Pro Pro Gin He Thr Trp Leu Leu Gly Asn Ser Met 
145 150 155 160 

Glu Val Ser Gly Gly Thr Leu His Glu Phe Glu Thr Asp Gly Lys Lys 
165 170 175 

Cys Asn Thr Thr Ser Thr Leu He lie Leu Ser Tyr Gly Lys Asn Ser 
180 185 190 

Thr Val Asp Cys He He Arg His Arg Gly Leu Gin Gly Arg Lys Leu 
195 200 205 

Val Ala Pro Phe Arg Phe Glu Asp Leu Val Thr Asp Glu Glu Thr Ala 
210 215 220 

Ser Asp Ala Leu Glu Arg Asn Ser Leu Ser Thr Gin Asp Pro Gin Gin 
225 230 235 240 

Pro Thr Ser Thr Val Ser Val Thr Glu Asp Ser Ser Thr Ser Glu He 
245 250 255 

Asp Lys Glu Glu Lys Glu Gin Thr Thr Gin Asp Pro Asp Leu Thr Thr 
260 265 270 

Glu Ala Asn Pro Gin Tyr Leu Gly Leu Ala Arg Lys Lys Ser Gly He 
275 280 285 



Leu Leu Leu Thr Leu Val Ser Phe Leu He Phe He Leu Phe He He 
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290 295 300 

Val Gin Leu Phe lie Met Lys Leu Arg Lys Ala His Val He Trp Lys 
305 310 315 320 

Arg Glu Asn Glu Val Ser Glu His Thr Leu Glu Ser Tyr Arg Ser Arg 
325 330 335 

Ser Asn Asn Glu Glu Thr Ser Ser Glu Glu Lys Asn Gly Gin Ser Ser 
340 345 350 

Leu Pro Met Arg Cys Met Asn Tyr He Thr Lys Leu Tyr Ser Glu Ala 
355 360 365 

Lys Thr Lys Arg Lys Glu Asn Val Gin His Ser Lys Leu Glu Glu Lys 
370 375 380 

His He Gin Val Pro Glu Ser He Val 
385 390 



<210> 145 
<211> 439 
<212> PRT 
<213> Homo sapiens 

<400> 145 

Met Thr Phe Thr Thr Val Leu His Val Ala Ala Leu Leu Leu Leu Gin 
1 5 10 15 

Gly Asp Phe Pro Gly Ala Gly Ser Glu Thr He Thr Val Gin Glu Gly 
20 25 30 

Glu Asp Leu Asn Leu Arg Cys Thr Phe Thr Gly Asp Ser Arg Ala Thr 
35 40 45 

Lys Gin Trp Leu Asn Pro Arg Gly Phe Thr He Phe Leu Asp Asn His 
50 55 60 

Trp Gly Leu Lys Asp Gin Arg Tyr Arg Leu He His Tyr Ser Glu Asp 
65 70 75 80 

Glu Leu Ser He Arg Leu Ser Asn He Thr Val His Asp Glu Gly Val 
85 90 95 

Tyr Lys Cys Tyr Tyr Tyr Ser Thr Pro Phe Arg Ser Lys Met Thr Thr 
100 105 HO 

Val Glu Val Leu Ala Ala Pro Ser Lys Pro Val Leu Gin Val Ser Arg 
115 120 125 

Asp Thr Glu Gly Arg Val Thr Leu Ser Cys Tyr Thr Gin Gly Cys Lys 
!30 135 140 

Pro Gin Pro Gin He Thr Trp Leu Leu Asp Asn Gly He Gin Leu Pro 
145 150 155 160 

Gly Asp Thr Arg His Lys Leu Glu Ala Asp Gly Lys Lys Trp Thr Thr 
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165 170 175 

Thr Ser Thr Leu Thr Val Leu Ala Tyr Gly Pro Asn Ser Thr Ala Thr 
180 185 190 

Cys Leu Val His His Lys Ala Leu Gly Gly Gly Lys Leu Thr Glu Pro 
195 200 205 

Phe Gin Phe Glu Asp Val Ala Arg Thr Val Ala Asn Thr Thr Pro Val 
210 215 220 

Ser Thr Thr Leu Glu Val Asp Thr Tyr Val Ser Glu Tyr Val Gin Pro 
225 230 235 240 

Thr Val Thr Thr Ala Glu Ser Asp Leu Asn Ser Asn Thr Asp Phe Ser 
245 250 255 

.Pro Ser Tyr Pro Gin His Asn Gly Ser Gly Ala Thr Ser Val Ala Ala 
260 265 270 

Gly Glu Leu Ser Gly Thr Ser Ala His His lie Pro Glu Gly Thr Glu 
275 280 285 

Thr Ala Leu Asn Gly Thr Val Thr Glu Glu Leu Phe Arg Thr Glu Ala 
290 295 300 

Ser Phe Pro Ser Glu Asn Val Thr Leu lie Ser He Val Thr Phe Glu 
305 310 315 320 

Gin Asp Val Lys Ser Glu Gly Met Ser Lys Lys Glu Lys Asp Phe Leu 
325 330 335 

Leu Pro Leu Leu Val Ala Val Leu He Val Met Leu Leu He He Val 
340 345 350 

Val Leu Phe Ser Lys Lys Leu He Lys Pro His Gly Val Trp Lys Arg 
355 360 365 

Glu Asn Asp Thr Ser Asp Gin Thr Leu Glu Ser Tyr Lys Ser Lys Ser 
370 375 380 

Asn Glu Glu Ser Pro Gly His Glu Lys Asn Gly Gin Ala Val Ser Gin 
385 390 395 400 

Lys Pro Asn Met Gin Tyr Val Thr Glu Gly Tyr Val Glu Ala Thr Gin 
405 410 415 

Lys Asn Pro Ser Glu Lys Asn Thr Lys Leu Pro Glu Glu Gin Phe Ala 
420 425 430 

Arg Gly Lys Glu Thr Asp Val 
435 



<210> 146 
<211> 252 
<212> PRT 
<213> Homo sapiens 
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<400> 146 

Ser Arg Asp Gly Phe His Arg Val Ser Gin Asp Gly Leu Asp Leu Leu 
1 5 10 15 

Thr Pro Cys Ser Ser Pro Leu Gly Leu Pro Lys Cys Trp Asp Tyr Arg 
20 25 30 

Gly Asp Pro Pro Arg Pro Val Leu Glu Asp Gly Ser Glu Ser Leu Glu 
35 40 45 

-> 

Tyr Leu Ser Ser Ser Asn Leu Lys Glu Val Leu Ala Cys Arg Gly Ser 
50 55 60 

Leu His Gly Trp Ala Gin Leu Val His Leu Pro Phe Ser Ala Tyr Ala 
65 70 75 80 

Gly Tyr Ser Ser Glu Pro Gly Thr Leu Leu Ser Ala Glu Leu Lys Leu 
85 90 95 

His Thr Met Val Leu Trp Pro Gin Phe Tyr Arg Ser He Leu Tyr Leu 
100 105 HO 

Leu Tyr Trp Leu Leu Arg Gly Arg Arg Asn Asn Thr Lys Pro Lys Pro 
115 120 125 

Phe Cys Cys Asp His Pro Pro Ser Tyr Pro Leu His Phe Arg Leu Tyr 
130 135 140 

Gin Met Glu Lys Thr Leu Ser Gly Asp Val His His Gin Tyr Tyr His 
145 150 155 160 

Gin Asp Phe Ser Arg Lys Tyr Tyr His Pro Gly He Cys Trp Leu Gin 
165 170 175 

Leu Leu Leu Leu Ser Ala Pro Phe His Ser lie Asn Met Leu Arg Glu 
180 185 190 

Phe Ala He Leu Ser Asn lie Leu Met His Ser His Lys Leu Gin Cys 
195 200 205 

Asn Ser Leu Leu Phe Met Tyr Lys Val Arg Asn Leu Cys Leu Leu Pro 
210 215 220 

Cys Trp His Leu Pro Leu Lys Ser Lys Ser Lys Cys Ser Tyr Ala Arg 
225 230 235 240 

Glu Ser Arg Val Thr Leu Phe Tyr Gly Ser Cys Ser 
245 250 



<210> 147 
<211> 369 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> SITE 



I 
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<222> (5) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (19) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (80) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (152) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (251) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (255) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (286) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (301) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (346) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (350) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (368) 

<223> n equals a,t,g, or c 
<400> 147 

GGCANAGTGA CACCAGGGNA GGGAATTAAT TCTCCCAAGG AAAACGCATT GAGTTACATG 60 



AAAAGGGGTC TAGGATTGAN CACTGCACTG AGAAACAACA ACATCTAGAT GATGGATAGG 



120 
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GGAGGAAGAA AGATTTCAGT TAAGGAGAAT GNAAGTTTCT CCTGAAATGA AAGGAAGAAA 180 

ACCAGGATAG TGCCATGCAA AGGAAGGTGT TTCAAAGAGG AGGAAGAGAT TAACAGTGCC 240 

AAATGCGACT NCGANTGTCA AAGAAGTTTG AAAGAAATAA GGGTTNTCCT GGGTAGCCTT 300 

NGTTAGAGTA GTTCTGGTGG AATGGTAGAC ATGGGAGTTA CCTTTNTATN GGTTGGGAAG 360 

GAAAGGGNG ,, Q 
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